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ON  THE  NECESSARY  CONDITION  FOR  OPTIMAL  CONTROL  OF 


NONLINEAR  SYSTEMS 

Hubert  Halkin 


Introduction . 

Words  such  as  "system, "  "control,"  "optimal  control,"  etc.,  have 
recently  become  very  popular  among  a  large  group  of  engineers,  partic¬ 
ularly  in  aero-astronautics  and  electronics,  as  well  as  among  many 
social  science  researchers  in  economics  and  psychology.  The  use  of 
the  same  vocabulary  in  totally  unrelated  fields  of  study  should  not 
surprise  us  when  we  realize  that  they  refer  to  problems  which  have  the 
same  mathematical  structures.  Among  these  mathematical  structures,  two 
are  of  particular  importance:  the  dynamical  system  and  the  control 
system  which  are  defined  as  follows: 

A  dynamical  system  is  a  pair  (Y,R)  where  Y  is  an  arbitrary 
space,  called  the  event  space,  and  where  R  is  a  binary  relation  on  Y 
such  that 


(i) 

aRa 

for 

all 

a  e  Y,  i.e.,  R  is  reflexive 

(ii) 

aRb 

and 

bRc 

implies  aRc,  i.e.,  R  is  transitive 

(iii) 

aRb 

and 

bRa 

implies  a  =  b,  i.e.,  R  is  antisymmetric 

(iv) 

aRb 

and 

aRc 

implies  either  bRc  or  cRb 

These  properties  of  the  relation  R  correspond  to  the  usual  con¬ 
cepts  of  causality  and  determinism.  We  may  think  of  the  formula  aRb 
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as  meaning:  the  event  b  follows  the  event  a.  This  correspondence 
will  be  seen  clearly  in  the  examples  given  later. 

A  control  system  is  a  triple  (Y,F,R( •))  where  F  is  an  arbitrary 
space,  called  the  strategy  space,  and  where  (Y,R(a))  is  a  dynamical 
system  for  every  a  in  F.  To  clarify  these  ideas  let  us  consider  a 
particular  type  of  dynamical  system  and  the  control  system  corresponding 
to  it. 

First  we  shall  consider  the  dynamical  system  associated  with  a 
system  of  ordinary  differential  equations.  In  such  a  case  the  space  Y 
is  the  Cartesian  product  of  the  real  time  axis  T  with  elements  t 
and  of  the  n-dimensional  Euclidean  space  X,  called  the  state  space, 
with  elements  x  =  (x  ,  ...  ,  x  ).  A  function  f(x,t)  from  X  X  T 
into  X  is  given.  A  binary  relation  R  over  T  X  X  is  then  defined 
by 

(t  ,Xi)  R( tg , Xg )  iff  x(tg,x1,t1)  *  x2  and  t^  < 
where  x(t;x*,t*)  is  the  solution  of  the  differential  system 

x  =  f(x,t)  a.e.  t 

satisfying  the  initial  condition 

x(t*JX*,t#)  -  X* 

We  shall  say  that  this  binary  relation  R  is  generated  over 
T  X  X  by  the  differential  system  x  «  f(x,t).  We  easily  see  that  the 
relation  R  satisfies  the  conditions  (i)  to  (iv)  given  earlier,  which 
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implies  that  (T  X  X,R)  is  a  dynamical  system.  More  generally,  the 
relation  R  could  be  generated  by  a  system  of  difference  equations,  of 
differential-difference  equations,  of  integral  equations,  etc.  The 
relation  R  could  even  be  defined  explicitly  by  an  appropriate  subset 
of  T  X  X  X  T  X  X. 

Let  us  now  consider  the  control  system  associated  with  a  class  of 
systems  of  ordinary  differential  equations.  The  space  Y  is  again  the 
Cartesian  product  of  the  real  time  axis  T  and  of  the  state  space  X. 
The  strategy  space  F  is  given  and  for  each  a  e  F  a  function 
f(x,t;a)  from  X  X  T  into  X  is  given.  The  binary  relation  R(a)  is 
then  generated  over  T  X  X  by  the  differential  system 

x  =  f(x,t;a) 

according  to  the  previous  definition.  The  triple  x  X,F,R(*))  is 
therefore  a  control  system.  We  shall  say  that  (t^x^)  is  reachable 
from  (t^x^  if  ( t1,x^)R(a)( t^,x2)  for  some  a  £  F. 

Physics,  as  most  descriptive  sciences,  is  concerned  with  the  study 
of  dynamical  systems,  whereas  engineering,  economics  and  the  other 
normative  sciences  are  concerned  with  the  study  of  control  systems. 

Classical  mechanics  offers  a  simple  example  of  a  dynamical  system: 
the  set  Y  in  classical  mechanics  is  the  Cartesian  product  of  the  time 
axis  and  of  the  state  space,  and  the  relation  R  is  generated  by  the 
laws  of  mechanics,  which  are  given  in  most  cases  under  the  form  of  a 
set  of  differential  equations .  However,  the  whole  field  of  physics 
cannot  be  reduced  to  a  scheme  of  such  simplicity;  no  Laplace's  observer 
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could  help.  Quantum  mechanics,  for  instance,  is  a  dynamical  system 
where  the  set  Y  is  the  Cartesian  product  of  the  time  axis  and  of 
function  spaces  of  probability  distributions. 

As  an  example  of  a  control  system,  let  us  consider  a  rocket.  A 
rocket  is  a  mechanical  system  equipped  with  regulatory  devices  such  as 
rudders,  thrust  modifiers,  etc.  The  variables  describing  the  position 
of  these  regulatory  devices  are  called  control  variables .  If  these 
control  variables  are  given  functions  of  the  time  and  of  the  state 
variables,  i.e.,  position,  velocity,  etc.,  then  we  have  a  dynamical 
system.  If,  however,  we  are  allowed  to  choose  the  functions  describing 
the  control  variables  in  a  certain  given  class  of  functions  called  the 
strategy  space,  then  we  have  a  control  system. 

It  is  not  difficult  to  imagine  many  examples  of  control  systems 
in  other  areas  of  engineering  and  economics.  It  should  be  noted  that 
a  control  system  could  be  stochastic:  in  many  problems  the  set  Y 
will  be,  as  in  quantum  mechanics,  the  Cartesian  product  of  the  time 
axis  and  function  spaces  of  probability  distributions. 

When  dealing  with  a  dynamical  system  the  essential  question  which 
one  should  ask  is:  how  does  it  behave,  i.e.,  given  a  e  Y  which  are 
the  properties  of  the  set  (b  :  aKb}?  This  question  has  been  the  object 
of  extensive  studies,  especially  the  theory  of  stability  and  oscilla¬ 
tion  of  dynamical  systems  described  by  ordinary  differential  equations . 

In  the  case  of  a  control  system  a  new  type  of  question  may  be 
asked:  what  is  the  ,fbestM  element  of  the  strategy  space?  For  example: 
given  a  rocket  and  initial  and  terminal  points  in  the  state  space,  how 
should  we  choose  the  control  variables  so  that  the  rocket  will  pass 
from  the  initial  to  the  terminal  point  in  the  minimum  amount  of  time? 
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Generally,  we  shall  define  an  optimal  control  problem  as  follows: 
Given 

(1)  a  control  system  (Y,F,R(*)) 

(2)  a  subset  D  of  Y  X  Y 

(3)  a  real  function  g  on  G  =  {(a,b,f)  :  (a,b,f)  e  D  X  F,  aR(f)b) 
where  (a,b,f)  =  {c  :  aR(f)c  and  cR(f)b} 

Find  an  element  (a,b,f)  €  G  such  that  g((a,b,f))  is  maximum.  This 
optimal  control  problem  is  denoted  by  the  quintuple  (Y,F,R( * ),D,g) . 

In  the  theory  of  optimal  control,  as  in  the  theory  of  dynamical 
systems,  it  is  possible  to  obtain  very  interesting  results  when  we 
assume  that  the  relation  R  is  generated  by  the  solutions  of  a  system 
of  differential  equations. 

In  this  work  we  shall  consider  control  systems  (Y,F,R(*))  of 
the  following  type: 

Y  is  the  Cartesian  product  of  the  real  time  axis  T  with  elements 
t  and  of  an  n -dimensional  Euclidean  space  X  with  elements 
X  =  (x  ,  ...  ,  X  ). 

F  is  the  class  of  measurable  r-dimensional  functions  [u]  de¬ 
fined  on  T  and  taking  their  values  in  a  given  set  ft. 

R([u])  for  [u]  €  F  is  generated  by  the  solutions  of  the  system 

x  *  f(x,u(t),t)  a.e.  t 

where  f(x,u,t)  is  a  given  function. 

With  this  particular  type  of  control  system  we  shall  associate  an 
optimal  controLjproblem  (Y,F,R(  ■  ),D,g)  of  the  following  type: 
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D  »  A  X  B  where  A  is  a  set  consisting  of  one  point  of  Y 
and  where  B  is  the  set  consisting  of  one  line  of  Y 
parallel  to  the  axis  x11. 

8«(xa,ta),  (^V*  [«]»  =  x£  . 

In  other  words,  we  are  given  a  point  A  in  X  X  T  and  a  line  B 
in  X  X  T,  parallel  to  the  axis  x11;  how  can  we  find  a  function  [u] 
in  a  given  class  F  such  that  starting  at  A  and  integrating  the 
system  x  =  f(x,u(t),t)  we  would  end  on  B  as  far  as  possible  in  the 
positive  direction  on  x11?  We  call  this  problem  the  fundamental 
problem  of  optimal  control.  This  fundamental  problem  is  stated  in 
greater  detail  in  Section  1. 

This  work  is  principally  devoted  to  the  study  of  the  necessary 
conditions  for  the  solution  of  the  fundamental  problem  of  optimal 
control . 

We  want  to  stress  here  the  fundamental  difference  between  a  clas¬ 
sical  problem  of  calculus  of  variations  and  a  problem  of  optimal  control. 
In  an  optimal  control  problem  the  set  ft  may  be  quite  arbitrary,  and 
due  to  technological  limitations  it  is  very  often  a  bounded  and  closed 
set:  for  instance,  the  thrust  of  a  rocket  can  only  vary  on  the  closed 
interval  [0,m]  where  m  is  the  maximum  available  thrust.  If  a 
classical  problem  of  calculus  of  variations  is  put  in  the  form  of  an 
optimal  control  problem,  the  corresponding  set  ft  is  always  open. 

This  explains  why  the  classical  techniques  of  the  calculus  of  varia¬ 
tions  do  not  work  for  the  general  case  of  optimal  control  problems . 
Indeed,  one  of  the  most  fundamental  cancepts  of  the  calculus  of  varia¬ 
tions  is  the  concept  of  the  arbitrary  variation:  you  compare  a  nominal 
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trajectory  corresponding  to  the  strategy  f  with  the  trajectory  corre¬ 
sponding  to  a  strategy  f  +  8  and  in  calculus  of  variations  this  is 
always  possible  when  5  is  small  enough.  In  optimal  control  this  is 
not  true  anymore:  if  along  a  rocket  trajectory  the  thrust  has  the 
maximum  available  value  ra  at  seme  time,  it  has  no  meaning  to  consider 
comparison  trajectory  where  the  thrust  is  augmented  by  a  positive  5, 
however  small  this  6  could  be. 

The  most  important  result  in  the  theory  of  optimal  control  is  the 
"Maximum  Principle  of  Pontryagin,"  a  generalization  of  the  Weierstrass 
E-test  of  the  classical  calculus  of  variations.  In  this  work  the 
"Maximum  Principle"  is  obtained  by  a  method  fundamentally  different 
from  the  method  of  Pontryagin  and  his  associates,  in  particular  we 
avoid  some  unresolved  topological  difficulties  encountered  in  their 
reasoning.  It  should  be  remarked  also  that  the  assumptions  we  are 
making  in  the  statement  of  our  problem,  in  particular  on  the  differ¬ 
entiability  and  boundedness  of  the  function  f(x,u,t),  are  much  weaker 
than  the  assumptions  made  by  Pontryagin  and  his  associates.  In  a 
previous  publication,  [12],  using  the  same  method,  we  obtained  the 
same  results  for  a  more  restrictive  class  of  problems. 

Any  mathematical  venture  is  made  up  of  two  parts:  geometrical 
intuition  and  analytical  machinery.  From  the  chronological  point  of 
view  the  geometrical  intuition  always  precedes  the  analytical  manipu¬ 
lation  in  the  formation  of  a  theory  and  the  first  is  of  great  help  to 
understand  the  second.  Unfortunately,  this  duality  has  a  marked 
tendency  to  disappear  and  the  role  of  geometrical  intuition  is  barely 
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noticeable  in  the  final  form  of  a  theory.  This  work  is  no  exception 
to  this  rule:  the  analytical  machinery  is  easily  seen.  Besides  clas¬ 
sical  results  of  the  theory  of  ordinary  differential  equations,  we 
use  some  extensions  of  the  results  of  Lyapounov  [16]  and  Blackwell  [3]  ' 
on  the  range  of  a  vector  integral  and  an  application  of  Brouwer's 
Fixed  Point  Theorem.  Unfortunately,  the  geometrical  motivation  is 
virtually  absent  from  this  work.  For  this  reason  we  shall  make  up  for 
this  deficiency  in  the  introduction.  More  precisely,  we  shall  gener¬ 
alize  the  concept  of  propagation  and  show  that  the  fundamental  problem 
of  optimal  control  described  above  can  be  viewed  as  a  problem  of  optimal 
propagation  in  an  abstract  space  X. 

A  standard  problem  of  classical  propagation  theory  has  the 
following  structure:  we  are  given  a  medium  with  a  propagation  law; 
the  medium  is  a  rest  for  t  <  tQ,  we  produce  a  certain  perturbation 
at  time  tQ  and  we  want  to  predict  what  will  happen  for  t  >  tQ. 

This  standard  problem  could  be  considered  on  two  different 
levels.  If  we  want  to  predict  the  intensity  of  the  perturbation  for 
every  element  in  space-time  as  a  function  of  the  intensity  of  the 
initial  perturbation  at  time  t  ,  we  have  what  we  call  a  quantitative 
propagation  problem.  In  seme  circumstances,  however,  it  is  enough 
to  predict  which  elements  in  space-time  could  possibly  be  perturbed 
as  soon  as  we  know  which  points  are  perturbed  at  the  time  tQ.  This 
is  what  we  call  a  qualitative  propagation  problem. 

To  solve  a  quantitative  propagation  problem  we  need  the  concept 
of  intensity  of  a  perturbation  and  a  precise  description  of  the 


8 


space-time  variation  of  this  intensity  which  is  usually  given  hy  a 
partial  differential  equation. 

In  this  work  we  shall  restrict  our  interest  to  qualitative  propa¬ 
gation  problems  and  consider  the  fundamental  problem  of  optimal  control 
as  a  generalization  of  the  qualitative  propagation  problem. 

To  every  element  (x,t)  €  X  X  T  we  shall  associate  the  set 

w(x,t)  =  {f(x,u,t)  :  u  €  n) 

The  set  w(x,t)  will  be  called  the  "wavelet"  at  the  point  x  for  the 
time  t.  The  analogy  with  optics  is  clear:  whenever  a  perturbation  is 
produced  at  the  point  x  at  the  time  t  than,  in  first  approximation, 
all  the  points  of  the  set 

(x  +  a  dt  :  a  e  w(x,t)} 

will  be  perturbed  at  the  r xne  t  +  dt. 

If  we  write  x(t  . aj)  for  the  solution  of 

X  =  f(x,u(t),t) 

satisfying  the  initial  condition  x(t  j[u])  =  x  then 

a  a 

W(t)  *  {x( t;  [u] )  :  [u]  e  F) 

is  the  set  of  points  of  X  which  at  the  time  t  could  possibly  be 

affected  by  a  perturbation  having  taken  place  at  x  =  x  at  the 

61 


9 


time  t  =  t&.  In  other  words,  the  boundary  dw(t)  of  the  set  W(t) 
is  the  wavefront  at  the  time  t  of  a  perturbation  starting  at  x  =  x 

8» 

at  the  time  t  =  t  . 

a 

We  define  a  ray  as  a  solution  x(t;[u])  such  that 

x(t;[u])c  dW(t)  for  all  t  e  [t  ,t,] 

a  d 

We  then  have  the  following  simple  but  fundamental  property:  if  an 

element  [u]  of  F  is  optimal  for  the  control  problem,  then  x(t;[u]) 

is  a  ray  of  the  propagation  problem.  The  proof  of  this  property  is 

given  in  a  previous  paper  [13]  and  may  be  summarized  as  follows:  if 

x(t  ;[u])  is  an  interior  point  of  W(t  )  for  some  t  e  [t  ,t  ] 

-L  1  a  d 

then  x(  t^;  [u] )  is  an  interior  point  of  W(t2)  for  all  1 2  €  [t-^t^] 

tl^ce  the  solutions  of  x  =  f(x,u(t),t)  at  the  time  t^  depend  con- 

^inuu^-y  on  the  initial  conditions  at  the  time  t^;  on  the  other  hand, 

if  [u j  is  upi-hnai  then  x(t^;[u])  is  a  boundary  point  of  W(t^) 

since  otherwise  Lher  would  be  another  [u]  e  F  with  (x(  t^;  [u]  ),t^)€  B 

and  xn(t^;[u])  >  xn(t^,  ,  contradicting  the  optimality  of  [u]  5 

hence  x(t;[u])  €  SW(t)  for  all  t  c  [t  ]. 

a  0 

The  optimal  control  problem  is  ther  reduced  to  the  study  of  the 
rays  of  the  abstract  propagation  problem.  We  may  generalize  to  an 
abstract  propagation  problem  the  Huyghens  Principle  and  the  associated 
Huyghens  construction.  The  basic  facts  about  such  a  propagation  may 
then  be  stated  as  follows:  if  a  wavefront  has  a  tangent  plane  at  a 
point,  then  the  wavelet  leading  to  this  point  is  entirely  located  on 
one  side  of  this  tangent  plane.  Consequently,  we  maximize  the  wave- 
front  velocity,  i.e.,  if  p  is  the  normal  to  the  wavefront  at  this 
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point  then,  along  a  ray  passing  through  this  point,  the  element  [u]  €  F 
is  such  that  the  scalar  product  of  p  and  f(x,u(t),t)  is  maximum. 

This  property  is  closely  related  to  Pontryagin1 s  Maximum  Principle. 

In  this  work  we  will  give  a  precise  analytical  formulation  of  this 
scheme.  To  verify  that  an  element  [v]  of  F  is  optimal,  we  adopt 
the  point  of  view  of  an  observer  riding  along  the  ray  x(t*[v])  and 
making  its  observation  in  a  moving  frame  of  coordinates  attached  to 
the  wavefront.  For  such  an  observer  all  the  missed  opportunities,  i.e., 
the  directions  he  could  have  followed  but  did  not,  are  leading  to  points 
on  one  side  of  a  hyperplane  passing  through  the  origin.  This  fact  can 
be  described  analytically  and  leads  to  the  mathematical  formulation  of 
the  Maximum  Principle  of  Pontryagin.  The  hyperplane  mentioned  above  is 
the  tangent  hyperplane  to  the  wavefront  whenever  such  tangent  hyperplane 
exists.  It  should  be  noted,  however,  that  our  derivation  does  not  re¬ 
quire  the  existence  of  such  a  tangent  plane  to  the  wavefront.  In  most 
intuitive  derivations  of  Pontryagin1 s  Maximum  Principle  the  existence 
of  the  tangent  plane  is  implicitly  assumed:  these  derivations  are  very 
unsatisfactory  since  the  real  strength  of  the  Maximum  Principle  of 
Pontryagin  lies  in  its  applicability  to  problems  where  this  assumption 
cannot  be  made. 

At  the  end  of  this  introduction  we  want  to  compare  the  geometries 
of  Finsler,  Riemann,  and  Euclid  with  the  geometry  induced  on  an  auton¬ 
omous  propagation  space  by  the  "wavelets"  w(x). 

Let  F(x,y)  be  a  real-valued  function  defined  on  X  X  X.  The 
function  F(x,y)  induces  in  the  space  X  a  geometry  for  which  the 
distance  ds  between  two  neighboring  points  x  and  x  +  dx  is  given 


by 
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ds  =  F(x,dx) 


This  geometry  is  called  a  Finsler  geometry  if 

(i)  F(x,ky)  =  kF(x,y)  for  every  k  >  0  and  all  (x,y)  e  X  X  X. 

(ii)  F(x,y)  >  0  if  y  h  0. 

(iii)  (x,y)  exists  and  is  positive  definite  for  all 

yy 

(x,y)  e  X  X  X. 

A  Riemannian  geometry  is  a  Finsler  geometry  such  that 

f^U,y)  =  X  gtk(x)  yV* 
i,k=l  1K 

and  a  Euclidean  geometry  is  a  Riemannian  geometry  where 


or  equivalently 

**(x,y)  =  l  (y1)2 

i=l 


To  a  Finsler  geometry  characterized  by  the  function  F(x,y)  on 
XXX  we  associate  the  set  valued  function  I(x)  on  X  defined  by 

I(x)  =  (y  :  F(x,y)  <  l) 

The  set  I(x)  is  called  the  indicatrix  at  the  point  x  of  the 
Finsler  geometry  on  X.  It  follows  from  the  conditions  (i)  to  (iii) 
that  for  a  Finsler  geometry  the  set  I(x)  has,  with  respect  to  the 
Euclidean  norm,  the  following  properties: 


12 


(a)  I(x)  is  closed  and  bounded. 

(P)  The  origin  is  an  interior  point  of  I(x). 

(7)  I(x)  is  strictly  convex  and  has  a  continuously  varying 
tangent  hyperplane  at  each  of  its  boundary  points.  In  particular,  I(x) 
is  an  ellipsoid  in  the  case  of  a  Riemannian  geometry  and  the  unit  sphere 
in  the  case  of  the  Euclidean  geometry. 

Conversely,  if  we  are  given  a  space  X  and  a  set  valued  function 
I(x)  defined  over  X  and  satisfying  the  conditions  (a),  (P)  and  (7) 
then  there  is  a  unique  function  F(x,y)  such  that 

I(x)  =  {y  ;  F(x,y)  <  1} 

Moreover,  this  function  F(x,y)  satisfies  the  conditions  (i),  (ii) 
and  (iii)  of  the  definition  of  a  Finsler  geometry. 

From  that  follows  that  a  Finsler  geometry  can  be  equivalently 
represented  by  the  function  F(x,y)  or  by  the  indicatrix  I(x).  We 
see  immediately  that  the  geodesics  of  the  Finsler  geometry  characterized 
by  the  indicatrix  I(x)  are  the  rays  of  the  abstract  autonomous  propa¬ 
gation  space  characterized  by  the  wavelets  w(x)  =  l(x). 

More  generally,  we  could  start  with  a  space  X,  an  arbitrary  set 
valued  function  I(x)  defined  on  X,  and  study  the  geometry  induced 
on  X  by  the  indicatrix  I(x),  i.e.,  the  geometry  for  which  the 
distance  ds  between  two  neighboring  points  x  and  x  +  dx  is  the 
smallest  nonnegative  real  number  a  such  that 
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In  that  case  the  rays  of  the  abstract  autonomous  propagation  space 
characterized  by  the  wavelets  v(x)  are  the  geodesics  of  the  geometry 
induced  on  X  by  the  indicatrix  l(x)  =  w(x). 

We  remark  that  the  class  of  wavelets  obtained  by  the  definition 

v(x)  =  (f(x,u)  :  u  e  n} 

in  the  case  of  an  abstract  autonomous  propagation  space  is  much  larger 
than  the  class  of  indicatrices  defined  by 

I(x)  =  (y  :  F(x,y)  <  1) 

in  the  case  of  a  Finsler  geometry. 

For  instance,  we  could  obtain  wavelets  for  which  the  origin  is  no 
more  an  interior  point,  which  are  not  closed,  which  are  not  strictly 
convex  or  even  with  a  lower  dimension  than  the  space  itself.  The 
geometry  obtained  by  taking  these  wavelets  as  indicatrices  can  have 
some  surprising  properties;  between  two  different  points  arbitrarily 
close  to  each  other  with  respect  to  the  Euclidean  norm,  we  could  have 
more  than  one  geodesics  or  even  no  geodesics  at  all.  Hence  the  geometry 
induced  on  the  space  X  by  the  wavelets  w(x)  is  much  more  general 
than  any  Finsler  geometry  defined  on  the  same  space  X. 
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A  GUIDE  TO  THE  READER 


In  Section  1  we  give  a  precise  statement  of  the  fundamental  prob¬ 
lem  of  optimal  control  mentioned  in  the  introduction.  In  Sections  2 
and  3  we  introduce  some  new  concepts,  perform  some  transformations, 
prove  a  few  propositions  in  order  to  be  able  to  state  precisely  the 
series  of  theorems  given  in  Section  4.  In  this  Section  4  are  assembled 
all  the  results  of  this  work:  the  necessary  condition  for  the  optimal 
control  of  a  nonlinear  uynamical  system.  In  Sections  5  to  9  we  estab¬ 
lish  some  intermediary  results  on  which  are  based  the  proofs  of  the 
theorems  of  Section  4.  These  proofs  are  given  in  Section  10. 

Although  this  work  is  entirely  devoted  to  the  theory  of  the 
general  nonlinear  dynamical  system  defined  in  Section  1,  we  shall 
make  frequent  references  to  the  theory  of  certain  linear  systems  intro¬ 
duced  at  the  end  of  Section  2.  We  do  it  for  the  following  reasons: 

(i)  Many  concepts  and  results  which  are  necessary  to  the  study 
of  nonlinear  systems  but  which  are  elaborate  and  difficult  when  dealing 
with  these  nonlinear  systems  become  particularly  clear  when  they  are 
applied  to  the  study  of  linear  systems . 

(ii)  The  methods  developed  here  for  nonlinear  systems  furnish 
a  very  simple  theory  for  the  linear  systems. 
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SECTION  1 


Statement  of  the  Problem 


In  this  section  we  shall  give  a  precise  formulation  to  the  funda¬ 
mental  problem  of  optimal  control  described  in  the  introduction. 

We  assume  that  we  are  given  the  following  elements: 

(i)  a  point 

A  =  (Vta)  =  (V  V  »  V  V  e  X  X  T  (1,1) 

where  X,  called  tne  state  space,  is  the  Euclidean  n-dimensional  space 

1  n 

with  elements  x  =  (x  ,  ,  x  )  and  where  T  is  the  real  line  with 

elements  t.  T  is  usually  interpreted  as  the  time  axis.  The  space 
X  X  T  is  called  the  event  space. 

(ii)  a  line  B  in  XX  T,  parallel  to  the  xn-axis  and  deter¬ 
mined  by  its  projections  x^,  i  *  1,  . . .  ,  n-1  and  t^  on  the  other 
axis.  More  precisely,  B  is  the  set 

{(x,t)  :  X1  =  x£  for  1  =  1,  ...»  n-1,  x11  e  R,  t  =  t^}  (1*2) 

where  R  is  the  real  line. 

(iii)  a  set 

flCU  (1.3) 

where  U,  called  the  control  space,  is  the  Euclidean  r-dimensional 

1  r 

space  with  elements  u  =  (u  ,  . . .  ,  u  ) 

(iv)  an  n-dimensional  vector  valued  function 

f(x,u,t)  =  (f^XjUjt),  f^(x,u,t),  ...  ,  fn(x,u,t))  (1.4) 
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(v)  the  class  F  of  all  bounded  measurable  [r-dimensional] 
vector  valued  functions  [u]  =  ((u(t),t)  :  t  e  [t^t^]}  satisfying 
the  condition 

u(t)  c  Q  for  all  t  e  [t  ,t,  ]  (1.5) 

SL  D 

Given  all  these  data  we  define  E  as  the  set  of  all  [n- dimensional] 

vector  valued  functions  [x]  =  {(x(t),t)  :  t  e  [t  ,t  ])  such  that 

a  0 

(1)  [x]  is  continuous  and  a.e.  differentiable  (1.6) 

(2)  x(ta)  =  xa  (1.7) 

(3)  there  exists  a  [v]  €  F  with  the  property 

x(t)  *  f(x(t),v(t),t)  a.e.  t  €  [ta,tb]  (1.8) 

(4)  there  exists  an  £>  0  such  that  f(x,u,t)  and  fx(x,u,t) 
are  defined,  measurable  with  respect  to  u  and  t,  uniformly  equi- 
continuous  with  respect  to  x,  and  uniformly  bounded  for  all 

( x, t,u)  €  N([x],e)  X  ft*  (1*9) 

where 

N([x],e)  =  {(x,t)  :  |x-x(t)|2+  |t-t|2<£2,  t  e  [t&,tb]}  (1.10) 

and  where  ft*  is  any  bounded  subset  ^f  ft.  - 

The  fundamental  problem  of  optimal  control  is  then  to  find  an 
element  [x]  in  E  such  that 

(a)  (x(tb),tb)  c  B  (1*11) 

( P)  for  any  [x]  in  E  such  that 
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(x(tb)>tb)  €  B 


(1.12) 


shall  hold  the  relation 

xn(tb)  <  x”^)  (1.13) 

The  problem  as  formulated  above  does  not  yet  exhibit  the  charac¬ 
teristic  structure  of  an  optimal  control  problem:  we  have  still  to 
introduce  the  strategy  space.  One  could  be  tempted  to  consider  as 
strategy  space  for  this  problem  the  totality  of  the  function  space  F 
introduced  earlier.  This  can  be  done  indeed  but  at  great  cost:  we 
must  make  strong  assumptions  on  the  function  f(x,u,t)  in  order  to 
insure  for  every  [u]  in  F  the  existence  and  uniqueness  of  the  solu¬ 
tion  to  the  differential  system 

x  =  f(x,u(t),t)  a.e.  t  g  [t  ,t  ]  (1.14) 

a  d 

with  the  initial  conditions 


x(ta)  =  xa  (1.15) 

In  this  work  we  prefer  to  avoid  making  any  further  assumption  on 
the  function  f(x,u,t).  Instead  we  shall  restrict  the  strategy  space 
to  an  appropriate  subset  F*  of  F  defined  as  the  set  of  all  [v]  e  F 
for  which  there  exists  an  [x]  €  E  with  the  property  (1.8). 

According  to  the  following  proposition,  the  set  F*  has  now  all 
the  properties  of  a  strategy  space  in  the  sense  of  the  introduction. 
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Proposition  1.1.  If  [v]  €  F*  then  there  is  a  unique  [x]  €  E  satis¬ 
fying  the  property  (1.8). 

Proof  of  Proposition  1.1.  By  definition  there  exists  at  least  one  such 

[x]  €  E;  let  us  assume  that  there  is  another  such  [x]  e  E  with 

[x]  h  [x].  Let  x  =  sup  (t  :  *x(0)  =  x(0)  for  G  <  t)  such  a  t 

t£[ta,tb] 

exists  since  x(t  )  =  x(t  )  =  x  .  Moreover,  x(t)  =  x(t)  since  [x] 
a  a  a 

and  [x]  are  continuous  and  t  /  t,  since  [3c]  ^  [x] .  Let  6  and 

D 

E  be  two  positive  real  numbers  corresponding  to  [x]  and  [3c ]  in  the 
definition  of  E.  Let  £*  *  min{  6,  "£ )  •  By  assumption  the  functions 
f(x,v(t),t)  and  fx(x,v(t),t)  are  then  bounded  and  measurable  with 
respect  to  t  in  the  £*  neighborhood  of  the  point  (x(t),t).  Hence, 
from  the  theory  of  ordinary  differential  equations,  there  is  a  B  >  0 
such  that  x(t)  =x(t)  for  t  €  [t,t  +  6].  This  contradicts  the 
definition  of  t  and  concludes  the  proof  of  Proposition  1.1. 

Proposition  1.1  allows  us  to  make  the  following  definition:  If 
[v]  €  F*,  let  [x( [ v ] ) ]  =  {(x(t; [v]),t)  :  t  €  [ t&, t^ ] }  be  the  unique 
element  in  E  with  the  property  (1.8). 

We  are  now  in  a  proper  position  to  state  the  problem  in  terms  of 
the  strategy  space  F*  as  follows: 

Find  an  element  [v]  €  F*  such  that 

(a)  (x(tb;[v]),t^)  e  B  (1.16) 

( P)  for  any  [w]  e  F*  with  the  property 

(x(tb;[w]),tb)  €  B  (1.17) 
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shall  hold  the  relation 


xn(tb;[v])  <  x^t^jtv])  (1.18) 

The  function  [v]  satisfying  the  conditions  (a)  and  (P)  shall  be 
called  an  optimal  control  function  and  the  corresponding  function 
[x( [ v] ) ]  shall  be  called  an  optimal  trajectory. 

Remarks  on  the  structure  of  the  function  f(x,u,t) 

We  should  mention  here  two  differences  between  the  statement  of 
the  fundamental  problem  in  optimal  control  given  above  and  the  funda¬ 
mental  problem  treated  by  Pontryagin  and  his  associates. 

In  our  formulation  we  allow  the  function  f(x,u,t)  to  be  depend¬ 
ent  of  the  variable  x11  to  be  maximized  at  time  t^.  We  are  allowing 
this  dependence  for  practical  and  esthetical  reasons:  to  make  the 
assumption  that  f(x,u,t)  is  independent  of  xn  would  lead  to  very 
little  simplification  of  the  subsequent  developments  but  would  never¬ 
theless  break  the  symmetry  among  the  state  variables .  Moreover,  many 
practical  problems  show  a  natural  dependence  of  the  differential 
equations  on  the  variable  to  be  maximized:  in  the  classical  problem 
of  the  maximization  of  the  payload  of  a  rocket,  the  evolution  of  the 
rocket  depends  on  its  mass  at  every  intermediate  instant  of  time. 

Also,  in  contradistinction  to  Pontryagin1 s  formulation,  we  do  not 
require  the  differential  system  to  be  time  independent. 

However,  by  an  appropriate  introduction  of  new  artificial  variables 
we  may  transform  our  problem  into  the  problem  treated  by  Pontryagin  and 
his  associates.  But  the  new  problem  obtained  by  this  introduction  of 
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artificial  variables  is  always  degenerate  in  the  following  sense;  any 
element  [u]  of  F*  satisfies  Pontryagin's  Maximum  Principle.  In 
order  to  obtain  nontrivial  necessary  conditions  for  an  element  [u]  of 
F*  to  be  optimal  we  must,  in  such  a  case,  use  the  so-called 
"transversality  conditions."  The  methods  developed  in  this  work  lead 
to  a  very  clear  geometric  interpretation  of  this  type  of  degeneracy 
which  will  be  discussed  in  detail  in  Section  4. 

Moreover,  because  of  the  assumption  on  the  continuous  dependence 
of  the  system  of  differential  equations  on  the  state  variables,  this 
transformation  cannot  be  done  if  the  time  dependence  of  the  differential 
equations  is  not  continuous;  this  is  to  be  contrasted  with  our  very 
weak  assumption  on  the  time  dependence  of  the  differential  equations; 
we  require  only  measurability  with  respect  to  time. 

We  want  to  make  another  remark  closely  related  to  the  introduction 
of  new  artificial  variables  and  on  the  necessity  to  consider  transver¬ 
sality  conditions  in  such  cases.  The  statement  of  the  problem  given 
here  is  made  up  of  two  parts;  we  define  a  control  system  with  initial 
conditions  by  (l.l),  (l.3)>  (1*4)  and  (1.5)  and  for  this  control  system 
with  initial  conditions  we  define  an  optimal  control  problem  by  (1.2), 
(1.11),  (1.12)  and  (1.13). 

Most  of  the  developments  made  in  the  following  sections  depend 
only  on  the  control  system  with  initial  conditions  but  not  on  the 
particular  optimal  control  problem.  In  fact,  as  we  shall  see  in 
Section  4,  our  results  are  directly  applicable  to  a  large  class  of 
optimal  control  problems;  this  will  allow  us  to  dispense  with  the 
formal  transformations  required  in  orc^er  to  apply  the  Maximum  Principle 
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of  Pontryagin  and  with  the  consideration  of  transversality  conditions 
which,  after  such  transformations,  are  strictly  needed  if  we  want  to 
obtain  a  nontrivial  set  of  necessary  conditions  for  an  optimal  solution. 


22 


SECTION  2 


Comoving  Coordinate  Space  along  a  Given  Trajectory 

In  the  introduction  we  wrote:  "To  verify  that  an  element  [v]  of 

F*  is  optimal  we  adopt  the  point  of  view  of  an  observer  riding  along 

the  trajectory  [x([v])]  and  making  his  observations  in  a  moving  frame 

of  coordinates  attached  to  the  wavefront.1*  In  this  section  we  intend 

to  carry  out  this  scheme:  for  an  arbitrary  element  [v]  of  F*  we 

shall  define  a  moving  frame  of  coordinates  Y([v])  by  an  appropriate 

transformation  from  X  X  to  Y((v])  X  [t  , t^l  and  for  each 

[u]  in  F*  we  shall  study  the  trajectory  [y([u],[v])]  = 

((y(tj [u], [v] ),t)  :  t  e  [t  ,t  ]}  which  is  the  transformation  in 

a  D 

Y([v])  X  [t  ,t  ]  of  the  trajectory  [x([u])]  in  X  X  [t  ,t_J.  The 
a  d  a  q 

space  Y([v])  X  [t^t^]  is  called  the  comoving  coordinate  space  along 
the  trajectory  [x([v])]. 

We  introduce  the  space  Y([v])  X  [t  ,t  ]  and  the  trajectories 

a  D 

[y([u],[v])]  in  that  space  for  the  following  reasons:  in  the  space 
Y( t v ] )  X  [ta,tfc]  there  is  a  very  natural  way  to  associate  with  every 
trajectory  [y([u],[v])]  an  approximate  trajectory  [y([u],[v])]  = 
((y(t;[u],[v]),t)  :  t  e  [t^t^]}  having  a  particularly  simple  structure. 
According  to  our  previous  analogy  the  trajectory  [y([u],[v])]  could  be 
considered  as  the  most  reasonable  approximation  of  the  trajectory 
[y( [u],[v])]  made  by  the  observer  riding  on  the  trajectory  [x([v])] 
and  knowing  the  function  f(x,u,t)  for  only  those  values  of  x  and 
t  which  are  in  the  neighborhood  of  his  own  trajectory. 
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The  consideration  of  these  various  types  of  trajectories  will  be 
of  great  help  to  derive  the  necessary  conditions  for  the  optimality  of 
the  trajectory  [x([v])]j  the  guiding  idea  of  this  derivation,  given  in 
Section  4,  could  be  summarized  as  follows:  assuming  that  [y([u],[v])] 
is  the  exact  expression  of  [y([u],[v])]  we  derive  easily  a  set  of 
necessary  conditions  for  the  optimality  of  [x([v])],  then  we  prove 
that  our  conclusions  are  still  valid  when  [y([u],[v])]  is  a  close 
enough  approximation  of  [y( [u],[v])] . 

After  these  commentaries  we  shall  now  proceed  with  the  precise 
definitions  of  the  entities  mentioned  above. 

For  any  [v]  €  F*  we  define  an  n  X  n  matrix  D(t;[v])  as 
follows : 


«t,W)  . 


x=x(t; [v]) 


* « t*.<V 


(2.1) 


More  precisely,  D( t ; [ v ] )  is  the  n  X  n  matrix  with  elements 
i,j  =1,2,  ...  ,  n;  defined  by 


Di  (  [v))  „ 

J  SxJ 


t  €  [t^]  (2.2) 


x=x(tj[v]) 


It  is  much  more  convenient  to  use  these  relations  in  the  form 
(2.1)  than  in  the  form  (2.2).  Such  a  convention  and  its  obvious  gen 
eralizations  will  be  used  throughout  this  work. 

From  our  assumptions  we  know  that  D(t;[v])  is  bounded  and 
measurable  over  . 


24 


Let  G(tj[v])  be  an  n  X  n  matrix,  continuous  with  respect  to  t, 


defined  over  [t  ,t  ],  satisfying  the  matrix  differential  equation 
a  d 

G(t;[v])=  -G(t;[v])  D(tj[v])  a.e.  t  e  [t^t^]  (2.3) 

and  such  that 


G(tb;[v])=  I  (2.4) 

where  I  is  the  identity  n  X  n  matrix. 

Proposition  2.1.  The  matrix  valued  function  G(t;[v])  exists,  is 
unique  and  bounded  over  [t  *1^]. 

Proposition  (2.1)  is  an  immediate  consequence  of  the  properties  of 

D(t;[v])  and  of  the  theory  of  ordinary  differential  equations. 

We  shall  now  introduce  a  Euclidean  n-dimensional  space  Y([v]) 

1  n 

with  elements  y  =  (y,...,y)  by  the  mapping 


4>(M)  :  XX  [t  ,t  ]  -»Y([v])  X  [t  ,t  ]  (2-5) 

a  d  a  d 


where 


(y,t)  =  4>(x,t;[v]) 


(2.6) 


is  defined  by 


y  =  G(tj[v])(x  -  x( t; [v] ) ) 


(2.7) 


Under  the  mapping  <t([v])  the  trajectory 
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[x(  [u])]  =  {(x(t;[u]),t)  :  t  e  [^,-t^]}  with  [u]  e  F*  (2.8) 
will  be  transformed  into  the  trajectory 

[y([u],[v])]  =  {(y(t;[u],[v]),t)  :  t  e  [ta,tb]}  (2.9) 

defined  by  the  relation 

y( t; [u] , [v] )  =  G(t;[v])(x(t;[u])-x(t;[v]))  for  all  te  [ t&, ] 

(2.10) 

Proposition  2.2.  For  every  [v]  and  [u]  in  F*  the  function 
[y( [u],[v])]  exists,  is  unique  and  continuous. 

Proposition  (2 .2) follows  directly  from  the  relation  (2.10)  since  we 
already  know  that  G( t; [v] ),x( t; [u] )  and  x(t;[v])  exist  separately, 
are  unique  and  continuous  over  [t  ,t  ]  for  [v]  and  [u]  €  F*. 

Let  us  now  define  the  approximate  trajectory 

[y([u],[v])]  =  ((y(t;[u],[v]),t)  :  t  e  ["ta, t^ ] }  (2.11) 

by  the  relation 

P  t 

y(t;[u],[v])  =  J  G(T;[v])(f(x(T;[v]),u(T),t)-f(x(Tj[v]),v(T),T))  dx 
t 

a 

for  all  t  e  [ta>t^]  (2.12) 

Proposition  2.3»  For  every  [v]  e  F*  and  every  [u]  €  F  the  function 
[y(  [u],[v])]  exists,  is  unique  and  continuous. 

Proposition (2. 5) follows  directly  from  the  definition  (2.12)  since 
we  already  know  that  G(t;[v]),f(x(t,[v]),u(t),t)  and  f(x(t;[v]),v(t),t) 
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are  measurable  and  bounded  over  [t  ,t,  ]  for  all  [v]  e  F*  and  all 

a  D 

[u]  €  F. 

As  we  mentioned  at  the  beginning  of  this  section,  the  trajectory 
[y( [u],[v])]  can  be  considered  as  an  approximation  of  the  trajectory 
[y(  [u], [v])] .  In  Section  7  we  shall  define  precisely  in  what  sense 
the  word  ,,approximation,,  should  be  understood  and  what  conclusions  we 
may  draw  from  it .  We  already  see  at  this  point  that 

y(t;[v],[v])  s  y(t;[v],[v])  =0  all  t  e  [b&, (2.13) 

The  proximity  of  [y([u],[v])]  and  [y([u],[v])]  is  particularly 
apparent  in  the  case  of  a  particular  class  of  linear  systems,  defined 
in  the  following  paragraph,  since  we  then  have 

[y([u],[v])]  =  [y([u],[v])]  for  all  [u]  and  [v]  €  F  (2.14) 

Application  to  a  Linear  System 

We  assume  here  that  f(x,u,t)  has  the  form 

f(x,u,t)  =  A(u,t)x  +  <p(u,t)  (2.15) 


or  the  form 

f(x,u,t)  =  A(t)x  +  Cp(u,t)  (2.16) 

When  the  function  f(x,u,t)  has  the  form  (2.15)  we  shall  speak  of  a 
linear  system  and  when  it  has  the  fonn  (2.16)  we  shall  speak  of  a 
linear*  system.  From  the  definition  it  follows  that  a  linear*  system 
is  a  particular  type  of  linear  system.  In  the  relations  (2.15)  and 
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(2.1 6)  the  expressions  A(u,t)  and  A(t)  are  n  X  n  matrices,  and 
qp(u,t)  is  an  n-dimensional  vector.  We  assume  that  A(u,t),  A(t)  and 
cp(u,t)  are  measurable  with  respect  to  their  arguments  and  uniformly 
bounded  over  ft*  X  [t  ,t.  ]  for  any  bounded  set  ft*  subset  of  ft. 

8.  D 

In  the  case  of  a  linear  system  we  have 

D(t;[v])  =  A(v(t),t)  for  all  t  e  [t  ,t  ]  (2.17) 

EL  D 

and  G(t;[v])  is  the  continuous  solution  of  the  matrix  differential 
equation 

G(t;[v])  =  -G(tj[v])  A(v(t),t)  a.e.  t  €  [ta,tfe]  (2.l8) 

with  the  terminal  condition 


C(tb;[v])  =  I  (2.19) 


Such  a  solution  is  usually  written  under  the  symbolic  form 


G(t;[v])  =  exp (/  D  A(v(t),t)  dr)  (2.20) 

t 


We  then  have 


y(t;[u],[v])  =  G(t;[v])(x(t;[u] )-x( t; [v] ) )  for  all  t  e  [ta,tfe]  (2.21) 


and 

y(t;(u],[v])  =  /  G(t;[v])(A(u(t),t)x(t;[v])-Hp(u(t),t) 

t 

a 

-  A( v( T ) , t)x( T j  [v] ) -cp( v( t) , T ) )  dx  (2.22) 
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for  all  t  e  [t  ,t.  ] 
a  d 


We  shall  now  compare  the  two  trajectories  [y([u],[v])]  and 
[y((u],[v])]. 

We  know  already  that 

y(ta;[u],[v])  =  y(ta;[u],[v])  (2.23) 

since  from  the  relation  (2.21)  we  have 

y(t  :[u],[v])  =  G(t  ;[v])(x(t  ;[u])-x(t  ;M)) 

&  a.  cl  cl 

-  G(t  ;[v])(x-  xj  -  0  (2.2U) 

a  a.  ti 

and  since  from  the  relation  (2.22)  we  have 

y(ta;[u],[v])  =  0  (2.25) 

Let  us  now  consider  y  *(t;  [u],[v])  and  y( t;  [u],  [v] ) .  We  have 
immediately 

y*(t{[u],[v])  =  G( t;  [v]  )(A(u( t),t)x(  tj  [v]  )-Kp(u( t),t) 

-A(v(t),t)x(tj[v])-<p(v(t),t))  dt 
for  a.e.  t  e  [ta,t.  ]  (2.26) 

and 
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.  V. 


y(t;[u],[v]) 

=  (G(t;[v])(x(t;[u])-x(tj[v])))‘ 

=  G(t;[v])(x(tj [u])-x(t; [v]))+G(t; [v])(x(tj[u])-x(tj[v])) 

=  -G(tj[v])A(v(t),t)(x(t;[u])-x(t; [v])) 

+G(t;[v])(A(u(t),t)x(t;  [u])-Kp(u(t),t)-A(v(t),t)x(t;[v])-<p(v(t),t)) 

=  G(t;[v])(A(u(t),t)x(t;[v])-Kp(u(t),t)-A(v(t),t)x(t;[v])-Cp(v(t),t)) 
+G(t;[v])(A(u(t),t)-A(v(t),t))(x(tj  [u])-x(t;[v])) 

*  G(tj[v]  )(A(u(t),t)x(t;  [v])-Kp(u(t),t)-A(v(t),t)x(t;  [v]  )-<p(v(t),t) ) 
+G(t;[v])(A(u(t),t)-A(v(t),t))G  1(t;  [v]  )y(tj  [u],  [v] ) 

a.e .  t  e  (2.27) 


The  relations  (2.26)  and  (2.27)  imply 
y(t;[u],(v])-y'(t;[u],[v]) 

=  G(t;[v])(A(u(t),t)-A(v(t),t))G  1(t;[v])y(t;[u],[v]) 

a.e.  t  €  (2.28) 

We  see  immediately  that  in  the  case  where 

A(u,t)  ■  A(v,t)  a.e.  t  €  [t&,t^]  (2.29) 

we  have 

y(t;[u],(v])»  y‘(tj[u],[v])  a.e.  t  €  [ ]  (2-30) 
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The  relations  (2.23)  and  (2.30)  then  implies 


y(t;[u],[v])  =  y(tj [u], [v] )  all  t  e  [ta,tb]  (2.31) 

We  can  then  state  the  following  result: 

Proposition  2.4*  For  a  linear*  system 

[y([u],[v])]  =  [y([u],[v])]  for  all  [u]  and  [v]  eF  (2.32) 

On  the  other  hand,  we  see  that,  even  for  a  linear  system, 

[y(  [u],[v])]  is  in  general  different  from  [y([u],[v])]  and  only  an 
approximation  of  [y([u],[v])]  in  a  sense  which  will  be  defined  in 
Section  7* 

The  identity  of  [y([u],[v])]  and  [y([u],[v])]  in  the  case  of 
a  linear*  system  is  particularly  helpful  to  obtain  quickly,  for  a 
linear*  system,  the  necessary  conditions  stated  in  Section  4  since 
Theorem  III,  the  most  difficult  theorem  of  Section  4  is,  as  we  shall 
see,  trivially  true  in  that  case. 
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SECTION  3 


Set  of  Reachable  Events 


In  this  section  ve  shall  introduce  the  important  concept  of  the 
set  of  reachable  events .  Given  a  control  system,  we  shall  say  that  a 
point  (xA,tft)  in  X  X  T  is  reachable  from  the  point  (x  ,t  )  in 

P  p  cx  u 

X  X  T  if  ta  >  t  and  if  there  exists  a  control  function  [u]  in 
p  —  a 

the  strategy  space  F*  such  that  the  solution  of  the  system 

{x  =*  f ( x,u( t ) , t )  a.e.  t  e  [t^tp]  (3-1) 

x(ta)  =  xa  (3.2) 

satisfies  the  terminal  condition 

x(tp)  =  Xp  (3-3) 

In  other  words,  following  the  terminology  used  in  the  introduction, 
we  say  that  (xQ,t0)  is  reachable  from  ( x_,, t  )  if  and  only  if 

p  p  u  u 

(ta,xa)R((u])(tg,Xg)  for  some  [u]  in  F*  (3-^) 

We  shall  consider  specially  the  set  H,  intersection  by  the  hyper¬ 
plane  t  »  t^  of  the  set  of  all  events  reachable  from  the  initial 
event  A  by  the  trajectories  [x([u])]  with  [u]  €  F*.  In  the  sub¬ 
sequent  analytical  developments  we  shall  also  use  the  set  H( [ v ] )  which 
is  the  intersection  by  the  hyperplane  t  *  t^  of  the  set  of  all  events 
reachable  from  the  initial  event  y  *  0  by  the  trajectories  [y([u],  [ v ])  ] 
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+ 

with  [u]  €  F*.  Similarly,  we  shall  consider  the  set  H([v])  which  is 
the  intersection  by  the  hyperplane  t  =  t^  of  the  set  of  all  events 
reachable  from  the  initial  event  y  *  0  by  the  approximate  trajectories 
[y(  [u],[v])]  with  [u]  e  F.  According  to  our  previous  analogy,  the 
set  H([v])  may  be  considered  as  the  most  reasonable  approximation  of 
the  set  H([v])  made  by  an  observer  riding  along  the  trajectory 
[x([v])]  but  knowing  the  function  f(x,u,t)  for  only  those  values  of 
x  and  t  which  are  in  the  neighborhood  of  his  own  trajectory. 

Formally,  we  then  have 

H  =  (x( j [u ] )  :  [u]  e  F*} 

H( [v] )  =  {y(tb;[u],[v])  :  [u]  e  F*}  for  any  [v]  e  F* 

H( [v] )  =  (y(tb;[u],[v])  :  [u]  e  F  }  for  any  [v]  e  F* 

We  immediately  have  the  relation 

H(  [v] )  =  (a-  x(tbj[v])  :  a  e  H} 

The  study  of  these  sets  and  particularly  of  their  boundaries  will 
be  made  in  the  next  section. 
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SECTION  k 


Necessary  Condition  for  the  Optimal  Control  of  a 
Dynamical  System 

In  this  section  we  give  a  series  of  seven  theorems.  These  theorems 
summarize  the  whole  content  of  this  work.  In  the  remaining  sections  we 
shall  he  concerned,  directly  or  indirectly,  with  the  proof  of  these 
theorems.  More  precisely,  we  shall  establish  in  Sections  5  to  9  some 
preliminary  results  which  will  be  used  in  Section  10  in  the  explicit 
proof  of  the  seven  theorems. 

Theorem  I .  If  an  element  [v]  of  F*  is  optimal  then  the  point 
x  =  x(t^;[v])  is  a  boundary  point  of  the  set  H. 

Theorem  II.  If  the  point  x  =  x(t^;[v])  is  a  boundary  point  of  the 
set  H  then  the  point  y  =  0  is  a  boundary  point  of  the  set  H([v]). 

Theorem  III.  If  the  point  y  =  0  is  a  boundary  point  of  the  set 
H([v])  then  y  =  0  is  a  boundary  point  of  the  set  H([v]). 

Theorem  IV.  If  the  point  y  =  0  is  a  boundary  point  of  the  set  H([v]) 
then  there  exists  a  nonzero  constant  vector  jt([v])  such  that  for  all 
[u]  in  F: 

U([v])|G(t;[v])(f(x(t;[v]),u(t),t)-f(x(tj[v]),v(t),t)))  <  0 

a.e.  t  c  [t  ,t,  ]  (4.1) 

a  d 
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By  <a|p>  we  mean  the  scalar  product  of  a  and  P. 


Theorem  V.  If  there  Is  a  nonzero  constant  vector  ff([v])  such  that 

the  condition  (4.1)  is  satisfied  for  all  [u]  in  F  then  there  is  a 

vector  p(t;[v])  continuous  and  nonidentically  zero  over  [t  ,t  ] 

a  d 

such  that: 

(i)  p(tj  [ v ] )  =  GT(t;[v])*(  [v])  all  t  € 

(ii)  for  all  [u]  in  F 

(p(t;[v]) |(f(x(t;[v]),u(t),t)-f(x(t;[v]),v(t),t)))  <  0 

a.e.  t  e  [ta,tb] 

(ill)  p(t;[v])  =  -  DT(t;[v])p(tj[v])  a.e.  t  e 

The  superscript  T  indicates  the  transposition  of  a  matrix. 

Theorem  VI.  If  the  point  x  *  x(t^;[v])  is  a  boundary  point  of  the 
set  H  then  there  exists  a  vector  p(t;[v]),  continuous  and  non¬ 
identically  zero  on  [t  ,t-  ],  such  that  the  conditions  (4.3)  and  (4.4) 

a  d 

are  satisfied. 

Theorem  VII .  If  an  element  [v]  of  F*  is  optimal  then  there  exists 
a  vector  p(t;[v]),  continuous  and  nonidentically  zero  on  [t  jt^]* 
such  that  the  conditions  (4.3)  and  (4.4)  are  satisfied. 

As  we  mentioned  earlier,  the  demonstrations  of  these  theorems  are 
given  in  Section  9*  The  demonstrations  of  Theorems  I,  II,  V,  VI  and 
VII  are  almost  immediate.  The  proofs  of  Theorems  III  and  TV  are  based 
on  the  results  established  in  Sections  5  to  10. 


(J*.2) 

(fc*3) 

(*.*) 
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If  we  define 


H(x,u,t,p)  =  (p|f(x,u,t)>  (4.5) 

then  Theorem  VII  could  be  equivalently  formulated  as  follows; 

If  an  element  [v]  of  F*  is  optimal  then  there  exists  a  vector 

p(t;[v]),  continuous  and  nonidentically  zero  on  [t  ,t  ]  such  that 

a  d 

P=p(t; [v ] ) 

a.e.  t  e  [ta,tb]  (4.6) 

i.e., 

x(tj[v])  =  f(x(t;[v]),v(t),t)  a.e.  t  e  [t&,tb]  (4.7) 

(ii)  p( t;  [v] )  =  -  v( t.L t_j l[xD  1 

x  x=x(t;[v]) 

a.e.  t  e  [tft, t^]  (4.8) 

(iii)  H(x(t;[v]),v(t),t,p(t;[v])  >  H(x( t; [v]),u(t),t,p(t; [v])) 

for  all  [u]  in  F  and  a.e.  t  €  [t&,t^]  (4.9) 

This  equivalent  formulation  of  Theorem  VII  is  the  well-known 
Maximum  Principle  of  Pontryagin. 

Let  us  make  some  comments  on  the  logical  structure  of  the  series 
of  theorems  given  earlier. 
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In  Theorem  I  we  associate  two  different  notions:  the  concept  of 
optimality  for  the  particular  optimal  control  problem  under  consideration 
and  a  topological  property  of  the  set  H,  which  set  depends  only  on  the 
given  control  system  with  initial  conditions  but  not  on  any  particular 
optimal  control  problem. 

In  Theorems  II  to  V  we  give  a  series  of  implications  concerning 
certain  properties  of  the  sets  H,  H([v])  and  H([v]). 

In  Theorem  VI  we  give  the  combined  result  of  all  the  implications 
contained  in  Theorems  II  to  V. 

In  Theorem  VII  we  use  Theorem  I  as  an  Intermediary  in  order  to 
obtain  from  the  topological  results  of  Theorem  VI  the  necessary  con¬ 
dition  for  an  optimal  solution  of  the  particular  optimal  control  problem 
under  consideration. 

Theorem  VI  is  the  most  important  result  in  the  theory  of  control 
systems.  This  theorem,  we  said  earlier,  depends  only  on  the  given 
control  system  with  initial  conditions  but  not  on  any  particular  optimal 
control  problem.  Hence,  when  a  particular  optimal  control  problem  is 
given,  we  need  only  to  verify  that  Theorem  I  is  valid  in  order  to 
derive  from  Theorem  VI  the  appropriate  necessary  conditions  for  an 
optimal  control.  The  verification  of  Theorem  I  is  particularly  simple 
in  the  case  of  the  fundamental  optimal  control  problem  considered  in 
this  work  but  could  also  be  easily  done  for  a  large  class  of  different 
optimal  control  problems. 

In  contradistinction  the  method  of  Pontryagin  and  his  associates 
is  the  following:  when  confronted  with  a  particular  optimal  control 
problem  they  introduce  new  artificial  variables  which  transform  the 
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control  system  itself,  and  hence  the  set  H  in  such  a  way  that,  for 
the  new  control  system,  the  particular  optimal  control  problem  has  the 
form  of  the  fundamental  optimal  control  problem.  Unfortunately,  the 
new  set  H,  obtained  after  introduction  of  these  artificial  variables 
has,  with  respect  to  the  new  event  space,  a  lower  dimensionality  than 
before.  In  this  case,  Pontryagin’s  Maximum  Principle,  in  the  form 
given  above,  can  be  trivially  satisfied  for  any  control,  not  necessarily 
optimal,  as  we  shall  show  in  a  later  paragraph.  For  such  case  Pontryagin 
and  his  associates  have  given  a  stronger  form  of  the  Maximum  Principle, 
including  some  auxiliary  conditions  similar  to  the  transversal ity  con¬ 
ditions  in  calculus  of  variations. 

Finally,  let  us  underline  how  closely  these  theorems  correspond  to 
the  intuitive  procedure  stated  in  the  introduction:  "To  verify  that  an 
element  [v]  of  F*  is  optimal  we  adopt  the  point  of  view  of  an  ob¬ 
server  riding  along  the  trajectory  [x([v])]  and  making  his  observations 
in  a  moving  frame  of  coordinates  attached  to  the  wavefront.  For  such  an 
observer  all  the  missed  opportunities,  i.e.,  the  directions  he  could  have 
followed  but  did  not,  are  leading  to  points  on  one  side  of  a  hyperplane 
passing  through  the  origin." 

By  Theorem  II  we  identify  the  moving  frame  of  coordinates  attached 
to  the  wavefront  with  the  comoving  coordinate  system  Y([v])  X  [£,1^] 
along  the  trajectory  [x([v])]. 

By  Theorem  III  we  show  that  for  our  purposes  the  set  H([v])  is 
as  good  as  the  set  H([v]).  In  our  analogy  the  set  H([v])  is  the 
most  reasonable  approximation  of  the  set  H([v])  made  by  the  observer 
riding  along  the  trajectory  [x([v])]  but  knowing  the  function  f(x,u,t) 
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for  only  those  values  of  x  and  t  which  are  in  the  neighborhood  of 
his  own  trajectory.  In  other  words,  Theorem  III  states  that  the  most 
reasonable  approximation  made  by  the  moving  observer  is  good  enough  as 
far  as  the  derivation  of  necessary  conditions  is  concerned. 

In  Theorem  IV  we  identify  the  vector  jt([v])  with  the  normal  to 
the  hyperplane  passing  through  the  origin  and  such  that  all  missed 
opportunities  are  directions  leading  to  points  located  on  one  of  its 
sides  only. 

Theorem  V  describes  the  same  property  as  Theorem  IV  but  from  the 
point  of  view  of  an  observer  fixed  in  the  space  X  X  T  instead  of  the 
moving  observer  considered  earlier. 

Remarks  on  the  dimensionality  of  the  set  H 

By  construction  the  set  H  is  a  subset  of  the  n-dimensional 
Euclidean  space  X.  In  Theorems  II  to  VI  we  have  derived  some  proper¬ 
ties  of  the  elements  [u]  €  F*  for  which  x(t^;[u])  is  a  boundary 
point  of  the  set  H.  In  other  words,  we  have  given  some  necessary  con¬ 
ditions  on  [u]  in  order  that  x(t;[u])  be  a  boundary  point  of  the 
set  H.  If  the  dimension  of  the  set  H  is  less  than  n  then  all  the 
previous  results  become  trivial  since  for  any  [u]  e  F*  the  point 
x(t^;[u])  will  be  a  boundary  point  of  the  set  H.  This  happens,  for 
instance,  when  the  set  H  is  a  subset  of  a  sufficiently  smooth  n-1 
dimensional  manifold  H*  nontangent  to  the  line  B1  projection  on  X 
of  the  line  B  in  X  X  T.  In  such  a  case  the  set  HPlBf  has  only 
isolated  points  and  therefore  all  [u]  €  F*  with  (x(  t^j  [u]  ),t^)  e  B 
are  locally  optimal.  This  explains  why  the  necessary  conditions  for 
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an  optimal  solution  can  be  trivially  satisfied  in  such  a  case  for  any 
control,  not  necessarily  optimal.  This  example  shows  the  need  of 
stronger  necessary  conditions.  This  need  is  partially  satisfied  by 
the  consideration  of  the  so-called  "transversality  conditions." 

Application  to  linear  systems 

We  saw  at  the  end  of  Section  2  that,  for  the  linear*  system  (2.16) 
we  have 

[y([u],[v])]  =  [y([u],[v])]  for  all  [u]  and  [v]  In  F  (4.10) 

In  particular,  this  implies 

fttjjfluMv])  =  yCt^jtuMv])  for  all  [u]  e  F  (4.11) 

1  •  €  *  , 

H(  [v])  =  H(  [v])  (4.12) 

The  relation  (4.12)  simplifies  greatly  the  derivation  of  the 
necessary  condition  for  the  optimal  control  of  a  linear*  system.  Indeed, 
Theorem  III  is  trivially  true  for  a  linear*  system  and  since  none  of  the 
other  theorems  are  particularly  difficult,  as  we  shall  see  in  Section  10, 
we  may  now  consider  the  entire  theory  for  the  optimal  control  of  a 
linear*  system  as  very  simple. 

In  the  case  of  the  linear  system  (2.15),  the  relation  (4.8)  takes 
the  simple  form: 


£(t;[v])  -  -  AT(v(t),t)  p(t;[v]) 
40 


(4.13) 


SECTION  5 


Norms  for  the  Space  of  Control  Functions  and  for  the 
Spaces  of  Trajectories 

In  this  section  we  define  various  norms  for  the  space  F  of  con¬ 
trol  functions  and  for  the  spaces  of  trajectories.  These  norms  will  he 
extensively  used  in  the  remaining  sections  of  this  work. 

Let  us  consider  an  arbitrary  collection  G  of  functions  from 
[t  ,t,  ]  to  a  Euclidean  space.  An  element  ((z(t),t)  :  t  e  [ta>t^]} 
in  that  collection  will  be  denoted  by  [z]. 

Let  us  define 

d( [ z ] , [ z ] )  =  ess  sup  |z(t)  -  z(t)|  (5*1) 

t  e 

and 

o([z],[z])  =  U((t  :  z(t)  b  z(t)  and  t  e  [ta,tb]})  (5-2) 

for  every  [z]  and  [z]  in  G. 

By  the  symbol  "ess  sup"  we  mean  the  essential  supreraum,  i.e., 

ess  sup  |  z(  t)  -  z(t)  |  =  inf  sup  |z(t)-z(t)|  (5-5) 

te[ta,tb]  as»0  tea 

where 

60  =  Uta,tb]~  B  :  u(B)  =  0}  (5-M 
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It  is  easy  to  prove  that  d( . , • )  and  o( . , . )  are  norms  for  a 
space  of  continuous  functions  and  semi -norms  for  a  space  of  measurable 
functions . 

In  this  work  we  shall  use  d(.,.)  over  the  spaces  of  trajectories, 
continuous  by  definition,  and  we  shall  use  d(.,»)  and  c(.,.)  over 
F,  the  space  of  control  functions,  measurable  by  definition. 

If  we  define  the  equivalence  relation  =  on  F  by 

[u]  =  [v]  iff  u(t)  =  v(t)  for  a.e.  t  e  [t^t^]  (5-5) 

then  d(  . , . )  and  o(  .,  .)  are  norms  for  the  quotient  space  T  ■  F/=  . 

In  order  to  simplify  the  notations  we  shall  talk  of  the  set  F 
even  where  we  should  talk  strictly  of  the  set  F  of  equivalence  classes 
of  F  under  =,  and  we  shall  simply  write 

[u]  =  [v]  (5.6) 

even  where  we  should  write  strictly 

[u]  =  [v]  (5-7) 

It  should  be  stressed  that  the  two  norms  a(  ., .)  and  d(  ., .) 
are  not  equivalent:  they  give  rise  to  two  completely  different  to¬ 
pologies  on  F. 

To  simplify  the  notation  d([z],0)  shall  be  written  d([z])  and 
similarly  c( [ z],0)  shall  be  written  c( [ z] ) . 


A  subset  ?  of  F  such  that  there  exists  a  k  with 


d(tu])  <k  for  all  [u]  €  ?  (5*8) 

is  called  a  d-bounded  subset  of  F. 

In  particular,  we  shall  denote  by  F^  the  set  of  all  elements  in 
F  such  that  d([u])  <  k,  i.e., 

Fk  *  {[u]  :  [u]  e  F,  d([u])  <  k}  (5-9) 

Similarly,  F*  will  be  the  set 

F£  •  ([u]  :  [u]  e  F*,  d([u])  <  k}  (5.10) 
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SECTION  6 


Curvilinear  Coordinate  Space  along  a  Given  Trajectory 

In  Section  2  we  have  associated  to  every  element  [v]  of  the 

function  space  F*  a  space  Y([v])  X  [t  ,t  ]  called  the  comoving  co- 

a  d 

ordinate  space  along  the  trajectory  [x([v])].  In  this  section  we 
shall  associate  to  every  element  [v]  of  the  function  space  F*  an¬ 
other  space  denoted  by  Z([v])  X  [t  ,t  ]  and  called  the  curvilinear 

a  d 

coordinate  space  along  the  trajectory  [x([v])].  The  consideration  of 

the  spaces  Z([v])  X  [t  ,t  ]  is  a  very  convenient  tool  in  the  study 

a  d 

of  the  existence  and  uniform  convergence  of  trajectories  corresponding 

to  control  functions  in  a  d-bounded  subset  of  F  in  terms  of  the  norm 

o  defined  in  Section  5  over  the  space  F  of  control  functions. 

In  the  case  of  the  linear  system  introduced  in  the  last  paragraph 

of  Section  2,  we  shall  prove  that  Z([v])  =  Y([v])  and  we  shall  show 

that  for  a  linear  system  the  results  given  in  this  section  take  a  much 

simpler  form.  Moreover,  if  the  system  is  a  linear*  system  we  proved 

already  in  Section  2  that  we  have  the  relation  [y([u],[v])]  = 

[y( [u], [v]) ] .  This  will  enable  us  to  show  that  for  a  linear*  system 

the  results  given  in  this  section  could  be  easily  derived  from  the 

classical  theory  of  nonhomo geneous  linear  differential  equations.  In 

the  last  paragraph  of  this  section  we  shall  consider  briefly  the  case 

of  linear  and  linear*  systems. 

If  [w]  e  F*  and  (x,t)  e  X  X  [t  ,t  ]  let 

a  d 

[x([v],x,t)]  =  ((x(tj[v],x,t),t)  :  t  €  [t^J)  (6.1) 


be  a  continuous  vector-valued  solution  of  the  differential  equation 


x(tj[w],x,t)  =  f(x(tj[w],x,t),w(t),t)  a.e.  t  e  (6.2) 

satisfying  the  initial  condition 

x(t;[v],x,t)  =  x(t;[w])+x  (6.3) 

For  every  [v]  €  F*  we  shall  now  introduce  a  Euclidean 

1  n 

n-dimensional  space  Z([w])  with  elements  z  =  (z  ,  ,  z  )  by 

the  mapping 

¥([w])  :  XX  [ta,tbl  -*Z([w])  X  (ta, ]  (6.4) 

for  which 

(z,t)  =  ¥(x,t;[v])  (6.5) 

is  determined  by  the  relation 

z  =  x(tb;[w],x-x(tj[w]),t)  -  x(tb;[w])  (6.6) 

or  equivalently  the  inverse  mapping 

y“1(  [w])  :  Z([v])  x  ->XX  (6.7) 


for  which 


(x,t)  «  «_1(z,tj[w]) 

^5 


(6.8) 


is  determined  by  the  relation 


X  =  x(t}  [w],z,tb)  (6.9) 

In  other  words,  the  mapping  ¥([w])  associates  to  an  element 
( x, t )  of  X  X  [ta,tb]  an  element  (z,t)  in  Z([w])x  [t^t^]  deter¬ 
mined  as  follows:  z  is  the  difference  of  the  projections  on  the 
terminal  hyperplane,  i.e.,  the  hyperplane  X  X  ( t^} ,  of  the  points 
(x,t)  and  (x(t;[w]),t)  of  the  hyperplane  X  X  (t)  where  the  pro¬ 
jecting  lines  are  the  solutions  of  the  differential  system  with  the 
control  function  [w] .  Conversely,  the  inverse  mapping  'i  1([w]) 
associates  to  an  element  (z,t)  in  Z([w])  X  [ta,tfe]  an  element 

(x,t)  in  XX  [t  ,t  ]  determined  as  follows:  x  is  the  projection 
a  d 

on  the  hyperplane  X  X  {t}  of  tho  point  (x(t^;[v))  +  0 *  ^e 

terminal  hyperplane  XX  (t^) .  The  projecting  lines  are  as  before  the 
solutions  of  the  differential  system  with  the  control  function  [w] . 

In  the  following  propositions  we  shall  prove  some  results  con¬ 
cerning  the  existence,  uniqueness  and  boundedness  of  the  mappings 
Y([w])  and  their  inverses  f  1([w]). 

Definition:  If  [v]  €  F*,  k  is  a  positive  number  and  l  is  the 
positive  number  associated  to  [v]  in  the  definition  of  E,  let  us 
define  F*([v],k)  to  be  the  set 

[tv]  :  [v]  €  F*,  d([w],[v])  <  k,  d([x([w])],[x([v])])  <  §}  (6.10) 
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It  should  be  remarked  that  we  do  not  know  at  this  point  if  the  set 
F*([v],k)  contains  any  other  elements  besides  [v].  In  Proposition  (6.10) 
we  shall  exhibit  a  large  class  of  elements  in  F  which  also  belong  to 
F*([v],k)  and  a  fortiori  to  F*.  The  set  F*([v],k)  plays  a  very  im¬ 
portant  role  in  this  work:  the  necessary  condition  for  the  optimality 
of  the  element  [v]  of  F*  will  be  derived  from  the  comparison  of  [v] 
with  only  those  elements  in  F*  which  also  belong  to  F*(  [v],k) . 

Proposition  6.1.  If  [w]  e  F*([v],k)  and  £  is  the  positive  number 
associated  to  [v]  in  the  definition  of  E  then  -g  can  be  used  as 
the  positive  number  associated  to  [w]  in  the  definition  of  E. 

Proof  of  Proposition  6.1.  From  the  definition  of  E  we  know  that 
f(x,u,t)  and  fx(x,u,t)  satisfies  certain  conditions  for  all 
(x,t,u)  e  N( [x( tv])],  &)  X  fl*  where  fl*  is  any  bounded  subset  of  fl. 

But  d(  [x(  [w])],[x(  [v])]  <  g-  implies  H([x(  [w])],|)CN([x([v])],£) 
hence  f(x,u,t)  and  fx(x,u,t)  will  a  fortiori  satisfy  the  same  con¬ 
ditions  for  all  (x,t,u)  e  N( [( [w] ) ], | )  X  n*.  This  concludes  the 
proof  of  Proposition  (6.1). 

Proposition  6.2.  If  [v]  e  F*,  £  is  the  positive  number  associated 

to  [v]  in  the  definition  of  E  and  k  is  an  arbitrary  positive 
number,  then  there  exists  an  M  >  0  such  that  for  all  |x|  <  , 

all  t  e  [t  ,t.  ]  and  all  [w]  e  F*([v],k)  we  have 

cl  D 

(i)  [x((w],x,t)l  and  j  exist  and  are  unique  (6.11) 

(ii)  d([x([w],x,t)],[x([w])])  <  |x|M  (6.12) 
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Proof  of  Proposition  6 .2 .  From  our  assumption  we  know  that 


U)  [x([w])]  exists  (6.13) 

(ii)  f(x,w(t),t)  and  ^  exist  and  are 

uniformly  bounded  for  all  (x,t)  e  N( [x( [w] ) ] , |>)  (6. it) 

By  definition  [x([w])]  and  [x([w],x,t)]  are  solutions  of  the 
same  differential  equation 


x  =  f(x,w(t),t)  a.e.  t  e  [t  ,t  ]  (6.15) 

a  o 


but  corresponding  to  different  initial  values.  Hence  from  the  theorem 
on  the  continuous  dependence  of  the  solution  of  a  differential  equation, 
we  know  that  there  exists  a  constant  M([w])  such  that  for  all 

c 

| x |  <  2M(  [w]  )  ?  e  [t^t^]  the  functions  [x([w],x,t)]  and 

I  exist  and  are  unique.  It  remains  to  show  that  we  can 

bx  J 

find  a  constant  M  such  that  M([w])  <M  for  all  [w]  e  F*([v],k). 

Let  j  |  denote  the  norm  of  the  matrix  — . 

This  norm  is  defined  as  usual  by  the  relation 


df(x.u.t) 

dx 


sup 

M<1 


df(x.u.t) 
dx  y 


(6.16) 


Let 


R 


sup 


df(x.u.t) 

dx 


(6.17) 
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over  all  values  of  (x,t)  €  N( [x( [v] ) ],  t)  and  all  values  of  u  with 
|u|  <  d([v])  +  k. 

Let 

M  =  2  e  b  (6.18) 

We  shall  now  show  that  this  constant  M  satisfies  our  require¬ 
ments  . 

Let  suPremum  all  closed  intervals 

[®l,e2^  ^a#tb^  such  that 

(i)  x(t;[w],x,t)  exists  and  is  unique  on  (6.19) 

(ii)  |x(t;[w],x,t)-x(tj[w]) |  <  |x|M  for  all  t  c  [d1>92^  (^-20) 

The  previous  definition  makes  sense  since  the  class  of  closed  in¬ 
tervals  satisfying  the  conditions  (6.19)  and  (6.20)  is  not  empty:  it 
contains  the  closed  interval  [t,t] . 

We  then  have 

|x(tj[w],x,t)-x(t;[w])  I  <  R|x(t;[w],x,t)-x(tj[w])| 

for  a.e.  t  €  [T^,Tg]  (6.21) 

and 

|x(t;[v],x,t)-x(t;[w])|  =x  (6.22) 

This  implies 

R(t -t  )  „ 

|x(t}[v),x,t)-x(t;[v])|  <  |x|  e  a  =  |x|  j  (6.23) 
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Hence  *  [t&,t^]  because  otherwise  there  would  be  a  closed 

interval  [0^,6*]  with  fT1#T2^  Ita,tb^  for  which  the  con¬ 

ditions  (6.19)  and  (6.10)  would  be  satisfied,  contradicting  the  definition 

of  [T^Tg]. 

The  relation  (6.23)  now  becomes 

d([x([w],x,t)],[x([w])])  <  |x|  |  (6.24) 

This  concludes  the  proof  of  Proposition  (6.2). 

Proposition  (6.2)  leads  immediately  to  the  following  result: 

Proposition  6.3*  If  [v]  €  F*,  6  is  the  positive  number  associated 

to  [v]  in  the  definition  of  E,  k  is  an  arbitrary  positive  number, 

and  M  is  the  positive  constant  introduced  in  Proposition  (6.2),  then 

for  every  [w]  e  F*([v],k)  the  mapping  Y([w])  :  X  X  [t  ,t  ]-»Z([w])  X 

a  d 

[ t&^ t^ ]  described  by  the  relation  (6.6)  is  well  defined  for  all  (x,t) 

in  XX  [ta,t^]  such  that  |x-x(t;(w])|  <  ^  and  the  resulting  (z,t) 

in  Z([w])  X  [t  ,t  ]  is  such  that  |z|  <  M  |  x  -  x(t;  [w] )  |  .  Conversely, 

the  mapping  ¥  ^([w])  :  Z([w])  X  [t  ,t  ]  -»XX  [t  ,t,  ]  described  by 

a  d  a  0 

the  relation  (6.9)  is  well  defined  for  all  (z,t)  in  Z([w])  X  [t^t^] 
such  that  |z|  <  7^  and  the  resulting  (x,t)  in  X  X  [t&,t^]  is 
such  that  | x  -  x(t; [w] ) |  <  M  | z|  . 

Convention  1.  We  shall  write  x(t;[w],x)  for  x( t; [w],x,t^) 

2.  In  the  remaining  part  of  this  work  [v]  will  always  be 
an  element  in  F*,  6  will  be  the  positive  number  associated  to  [v] 

in  the  definition  of  E,  k  will  be  an  arbitrary  positive  number  and 
M  will  be  the  positive  number  introduced  Proposition  (6.2). 
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Definition.  For  [w]  e  F*([v],k)  and  |z|  <  let 


D(t;[w],z) 


c)f(x,w(t),t) 

ox 


x=x(tj [w],z) 


for  all  t  e  [t  ,t,  ]  (6.25) 

8.  D 


Proposition  6.4.  The  matrix  valued  function  D(t;[w],z)  is  measurable 
with  respect  to  t  and  uniformly  bounded  for  all  t  €  [t&,t^],  all 
[w]  e  F*([v],k)  and  all  |z|  < 

Proof  of  Proposition  6.4.  From  [w]  e  F*([v],k)  and  |z|  < 
we  know  that 


(i) 

(x(t;[w],z),t)  e  N([x([v])],£) 

for  all 

t  €  (S^e) 

(ii) 

|v(t) |  <  d( [ v] )  +  k 

for  all 

t  e  [ta,tb]  (6.27) 

|D(t;[w],z) |  <  R 

for  all 

t  6 

where  N  is  the  positive  constant  introduced  in  the  proof  of 
Proposition  (6.2).  This  concludes  the  proof  of  Proposition  (6.4). 

Definition.  For  [w]  e  F*([v],k),  |z|  <  ^  let  G(t;[w],z)  be  the 
continuous  solution  of  the  matrix  differential  equation 

G(t;[w],z)  =  -G(t;[w],z)  D(t;[w],z)  a.e.  t  e  [t^t^]  (6.29) 

with  the  terminal  condition 


G(tbj[w],z)  =  I 


(6.30) 
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where  I  is  the  identity  matrix. 


Proposition  6.5 .  The  matrix  G(t;[w],z)  exists,  is  uniformly  contin¬ 
uous  with  respect  to  t  and  uniformly  bounded  for  all  t  c  [t  ,t  ], 

a  d 

all  [w]  e  F*([v],k)  and  all  |z|  <  . 

Proof  of  Proposition  6.5»  Immediate  from  Proposition  (6.4)  and  the 
theory  of  linear  differential  equations  with  bounded  measurable  coef¬ 
ficients  .  Moreover, 

N( t  -t  ) 

|  G(  t ;  [w],z)  |  <  e  a  <M  (6.31) 

for  all  t  c  [t  ,t^],  all  [w]  e  F*([v],k)  and  all  |z|  <  ,  where 

M  is  the  positive  constant  introduced  in  the  proof  of  Proposition  (6.2). 
This  concludes  the  proof  of  Proposition  (6.5)* 

Proposition  6.6.  The  matrix  G  ^*(t;[w],z),  inverse  of  the  matrix 
G(t;[w],z),  exists,  is  uniformly  continuous  with  respect  to  t  and 
uniformly  bounded  for  all  t  c  [tft,t^],  all  [w]  e  F*([v],k)  and  all 


Proof  of  Proposition  6.6.  Let  G*(t;[w],z)  be  the  continuous  solution 
of  the  matrix  differential  equation 


G*(t;[w],z)  =  Dftjtv^z)  G*(t;[w],z)  a.e.  t  e 


(6.32) 


with  the  terminal  condition 


=  I 
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For  the  same  reasons  as  in  Proposition  ( 6 . 5 ) ,  we  know  that  the  matrix 
G*(t;[w],z)  exists,  is  uniformly  continuous  with  respect  to  t  and 
uniformly  bounded  for  all  t  e  [ta,t^],  all  [w]  e  F*([v],k)  and  all 
I z I  <  •  We  will  now  prove  that 

G(t;[w],z)  G*(  t;  [w] ,  z)  =  I  for  all  t  €  [t^]  (6. 3*0 

i.e., 

G*(tj[w],z)  *  G_1(t;[w],z)  for  all  t  e  [t^]  (6-35) 

We  already  know  that 

G(tb;[w],z)  G*(tb;[w],z)  =  II  =  I  (6.36) 

and  that  G(t;[w],z)  G*(tj[w],z)  is  continuous  over  [t&,tb) .  It 
remains  to  verify  that 

(G(t;[w],z)  G*(t;[w],z)) ’  =  0  a.e.  t  e  [ta,tb]  (6-37) 

This  last  relation  is  immediate  since 
G(t;[w],z)  G*(t;[w],z)+G(t;[w],z)  G*(t;[w],z) 

=  -G(t;[w],z)D(tj[w],z)G*(t;[w],z)+G(tj[w],z)D(tj[w],z)G*(t;[w],z) 

=  0  a.e.  t  e  [ta,tb]  (6.38) 

This  concludes  the  proof  of  Proposition  (6.6). 
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Proposition  6.7.  For  all  t  e  [t  ,t.  ],  all  [w]  €  F*([v],k) 

a  d 

I  z  I  —  m  We  ^laVe 

o'Ht, (»],«>  - 

Proof  of  Proposition  6. 7*  Let 

A(t)  =  o-1(t,[w],z)  -MSjLsLsl 

we  have 

A(t^)  =1-1  =  0 

and 

A(  t)  =  D(t;[w],z)  G  1(t;  [w],z) f(x(  t;  [w] ,  z),w(  t),t) 
=  D(tj[v],z)  G  1(t;[w],z)  -D(t;[w],z) 

=  D(tj[w],z)  A(t)  a.e.  t  e 

From  the  relations  (6.4l)  and  (6A2),  we  obtain 

A(t)  =0  all  t  €  [t^tj 

This  concludes  the  proof  of  Proposition  (6.7)* 

Under  the  mapping 

*([v])  :  xx  -*  Z(  [w] )  X  [ta,tb] 


and  all 


(6.39) 


(6.40) 


(6.4l) 


(6.42) 


(6.43) 


(6.44) 
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the  trajectory 


[x([u])]  =  ((x(tj[u]),t)  :  t  €  [ta,tb]}  with  [u]  €  F*  (6.45) 
will  he  transformed  into  the  trajectory 

(z( [u],[w])]  =  {( z(tj [u], [v] ),t)  :  t  e  [t  ,t,  ])  (6.46) 

a  o 

according  to  the  relation 

z(tj[u],[w])  =  x(tb;[w],x(t;[u])  -  x(t;(w]),t)  -  x(tb;[w]) 

for  all  t  e  [t  ,t.  ]  (6.47) 

a  d 


Conversely,  under  the  mapping 

Y  1([w])  :  Z( [w] )  x  [ta,tfe]  — » X  x  [t&,tb]  (6.48) 

the  trajectory  [z([u],[w])]  will  be  transformed  into  the  trajectory 
[x([u])]  according  to  the  relation 

x(t;[u])  =  x(t;[v],z(t;[u],[v]))  all  t  c  [t  ,t.]  (6.49) 

a  d 

In  other  words,  the  mapping  ¥([w])  associates  to  a  trajectory 
tx([u])]  in  XX  [t&,tb]  a  trajectory  [z([u],[w])]  in  Z([w])x 
^a^S)^  determined  as  follows:  z(t;[u],[w])  is  the  difference  of  the 
projections  on  the  terminal  hyperplane  X  X  (th)  of  the  points 
(x(tj[u]),t)  and  (x(t;[w]),t)  of  the  hyperplane  X  X  (t)  where  the 
projecting  lines  are  the  solutions  of  the  differential  system  with  the 
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control  function  [w] .  Conversely,  the  Inverse  mapping  f”^([w])  asso¬ 
ciates  to  a  trajectory  [z([u],[w])]  In  Z([v])  x  [t^t^]  a  trajectory 
[x( [u])]  in  X  X  [t  ,t^]  determined  as  follows:  x(tj[u])  is  the 
projection  on  the  hyperplane  X  X  (t)  of  the  point  ( x( t^; [w] )  + 

^(t^; [u] ),t^)  of  the  terminal  hyperplane  X  X  (t^) .  The  projecting 
lines  are  as  before  the  solutions  of  the  differential  system  with  the 
control  function  [w] . 

Proposition  6.8.  If  [u]  and  [w]  c  F*([v],k)  then 

(i)  z(tj[u],[w]) 

=  G(t}[v],z(t;[u],(v]))(f(x(tj[u]),u(t),t)-f(x(t;[u]),w(t),t)) 

for  a.e.  t  e  [ta,t^]  (6.50) 

(ii)  z(t&; [u],[v])  =0  (6.51) 

Proof  of  Proposition  6.8.  If  [u]  and  [w]  e  F#([v],k)  then 
[z([u],[w])]  exists  and  we  have  the  relation 

x(t;[u])  =  x(tj[w],z(t;[u],[w]))  for  all  te  (^>^3  (6-52) 

By  differentiation  of  (6.52)  with  respect  to  t  we  obtain 

f(x(tj[u]),u(t),t) 

*  f(x(t;[u)),w(t),t)+G"1(t;[w],z(t;[u],[v]))z(t;(u],[w]) 

for  a.e.  t  e  [-ba, t^]  (6.53) 
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since 


MliMal)  ■  o'Vjlwhz)  (6.5M 

as  was  proved  in  Proposition  (6.7)*  From  the  relation  (6.53)#  we  obtain 
the  relation  (6. 50). 

If  we  let  t  =  t  in  relation  (6.47),  we  have 
a 

z(ta;[u], [v] )  =  x(ttj[w],0,ta)  -xtt^jtw]) 

=  x(tb;M)  -  xCt^jCw])  =  0  (6.55) 

This  concludes  the  proof  of  Proposition  (6.8). 

Proposition  6.9.  There  are  positive  constants  P  and  Q  such  that 
for  all  [w^]  €  F*([v],k)  and  all  [w^l  €  F  with 

(i)  d(  [w^jwg])  <  k  (6.56) 

(ii)  a([w1],(w2])  <  Q  (6.57) 

we  have 

U)  [w2l  e  F*  (6.58) 

(ii)  d([x([w1])],[x([wg])])  <  ^  (6.59) 

(iii)  d([x([w1))Mx([w2))])  <  Po([w1],[w2])  (6.60) 

Proof  of  Proposition  6.9.  Let 

L  *  sup  |f(x,u,t)|  (6.6l) 
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over  all  (x,t)  €  N(  [x(  [v] )  ],  6),  all  |uj  <  d([v])  +  2k.  Let  M  be 
the  positive  constant  introduced  in  Proposition  (6.2).  We  shall  prove 
that  the  relations  (6.58),  (6.59)  and  (6.60)  are  valid  when  the  constant 
P  and  Q  are  determined  by 


P  =  2  M2L 

(6.62) 

Q  =  % 

(6.63) 

x  be 

the  supreimim  of  all  times  6  e  [tft,t^]  such  that 

(D 

x(t;[w  ])  exists  and  is  unique  on  [t  ,9] 
d  a 

(6.64) 

(ii) 

|x(t;[v2])  -x(t;[w1])|  <  for  all  t  e  [ta,e] 

(6.65) 

(iii) 

z(t;[w0],[w1  ])  exists  and  is  unique  on  [t  ,9] 
d  ±  a 

(6.66) 

(iv) 

IzttjtWg],^])!  <  for  all  t  €  [ta,0] 

(6.67) 

This  definition  makes  sense  since  the  set  of  all  times  9  such  that 

the  relations  (6.64)  to  (6.67)  are  satisfied  on  [t  ,0]  is  not  empty: 

it  contains  the  time  t  . 

a 

Fran  (6.65)  and  (6.6l)  we  have 

|f(x(tj[w2]),w2(t),t)-f(x(t;[w2]),w1(t),t)|  <2L 

for  all  t  e  [t  ,t]  (6.68) 

a 

From  the  relation  (6.67)  and  Proposition  (6. 5)>  we  have 

|G(t;[w1],z(t;[v2],[v1]))|  <  M  for  all  tetter]  (6.69) 
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By  definition,  we  have 


MX  (t:f(x(tj  (Wg]),Wg(t),t)  -  f(x(t;[Wg]),w1(t),t)  k  0,  tetter]}) 

<o(t  w^jwg])  (6.70) 

From  the  relations  (6.68),  (6.69)  and  (6.70),  and  from  the  differ¬ 
ential  equation  (6.50)  with  initial  condition  (6.51),  we  obtain 


!z(t;(w2],[w1])|  <2LMo([w1],[w2])  for  all  t  e  [t&,T]  (6.71) 

With  the  help  of  relations  (6.57)  and  (6.63),  the  inequality 
(6.71)  becomes 


U(t;[w  ],[w  ])|  <  for  all  t  c  [t  ,t]  (6. 72) 

4  M  a 

Applying  Proposition  (6.3)  to  relations  (6. 71)  and  (6. 72),  we 
obtain  the  two  relations: 

|x(t;[Wg])  -x(t;[w1])|  <  2LM2a(  [wj,  [Wg] ) 

for  all  t  e  [t  ,t]  (6.73) 

a 

|x(tj[Wg))  -  x(t;[w1])|  <  for  all  te  [ta,t]  (6.7k) 

Frcm  (6.72)  and  ( 6 -7^)  we  conclude  that 

T  -  \  (6.75) 
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■because  otherwise  there  would  be  a  0*  e  (T,tfc]  for  which  all  the  con¬ 
ditions  (6.64)  to  (6.67)  would  be  satisfied,  in  contradiction  with  the 
definition  of  t.  Hence  , 

[Wg]  e  F*  (6.76) 

and  the  expression  (6.74)  can  now  be  written 

|x(t;[w2])-  x(tj[w1])|  <  for  all  te  [ta,tt]  (6.77) 

i.e., 

d([x([w2l)],[x([w1])])  <  (6.78) 

and  a  fortiori 

d([x([w2])],[x([w1])])  <  |  (6.79) 

From  relations  (6.62),  (6.73)  and  (6.75),  we  have 

|x(t;[wg])  -  x(t;[w1))|  <  Pa  ( [w^,  [wg] )  for  all  t  e  [ta,tfe]  (6.80) 

i  •  6  •  f 


d([x([w2])],[x([w1])])  <  Fa  ( [wL], [w2])  (6.8l) 

Relations  (6.76),  (6.79)  and  (6.8l)  are  the  required  relations  (6.58), 
(6.59)  and  (6.60).  This  concludes  the  proof  of  Proposition  (6.9)* 
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Proposition  6*10.  If  Q  is  the  positive  constant  introduced  in 
Proposition  (6*9)>  then  for  all  [u]  €  F  with  d([u],[v])  <  k  and 
cr([u],[v])  <  Q,  we  have  [u]  e  F*(  [v],k) . 

Proof  of  Proposition  6*10.  By  applying  Proposition  (6.9)  to 
[w^]  =  [v]  and  [Wg]  -  [u],  we  obtain 

(i)  [u]  e  F*  (6.82) 

(ii)  d([x([v])],[x([u])])  <  |  (6.83) 

From  relations  (6.82)  and  (6.83)  and  from  the  assumption  d([u],[v])  <  k, 
we  then  have 

[u]  €  F*([v],k)  (6.84) 

This  concludes  the  proof  of  Proposition  (6.10). 

Proposition  6.11.  If  Q  and  P  are  the  positive  constants  introduced 
in  Proposition  (6*9)#  then  for  all  [u^]  and  [u^]  €  F  with 
d(  [ujjv])  <  k,  d(tu2],[v])  <  k,  o(  [u.^,  [v])  <  Q  and  a(  [UgMv])  <  Q, 
we  have  d(  [x(  [oj )],  [x(  [ug] )] )  <  Po(  [u.^,  [Ug] ) . 

Proof  of  Proposition  6.11.  By  applying  Proposition  (6.10)  to  [u]  =  [u^] 
we  obtain  [u^]  €  F*([v],k),  and  by  applying  Proposition  (6.9)  to 
[wx]  =  [u2]  and  [wg]  =  [Ug]  we  obtain  d(  [x(  [U.J  ) ],  [x(  [Ug] ) ] )  < 

P  a( [u^], [Ug] ) .  This  concludes  the  proof  of  Proposition  (6.1l). 
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Proposition  6.12.  If  Q  is  the  positive  constant  introduced  in 
Proposition  (6<9)>  then  there  is  a  positive  number  K  such  that  for 
all  [Ul]  and  [u2]  e  F  with  d(  [tx^] ,  [v] )  <  k,  d(  [u2],  [v])  <  k, 
o(  [uj,  [v] )  <  Q  and  a(  [u2],  [v] )  <  Q,  we  have 

d([y([u1],[v])],[y(fu2],[v])])  <Ko(  [u^jug])  (6.85) 

Proof  of  Proposition  6.12 .  By  definition  (see  (2.8)),  we  have 

y(t;[u],[v])  =  G(t; [v])(x(t; [u])-  x(t;[v])) 

for  all  t  e  [t  ,t,  ]  (6.86) 

a  D 

From  Proposition  (6.5)*  we  have 

| G( t;  [ v] )  |  =  | G(  tj  [v]  ,0)  |  <  M  for  all  t  e  [t^t^]  (6.87) 

From  relation  (6.86),  we  have 

yCtjt^JJv])  -  y(t;[u2],[v])  =  G(t;[v])(x(tj[u1])  -x(tj[u2))) 

for  all  t  e  [t  .t^]  (6.88) 

From  relations  (6.87)  and  (6.88),  we  have 

d([y([Ul],[v])],[y([u2],[v])])  <  Md(  [x( [uj ) ], [x( [u2] ) ] )  (6.89) 
But  by  Proposition  (6.11),  we  have 

d([x([u1])],[x([u2])])  <  Podu^tug])  (6.90) 
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From  (6.89)  and  (6. 90),  we  then  have 


d(  [y([u1],[v])],[y([u2],[v])]) 

<  PM  o(  [u^l^Iug])  ■Kff([u1],[u2l)  (6.91) 

This  concludes  the  proof  of  Proposition  (6.12). 

All  the  results  of  this  section  which  we  shall  need  later  on  can  be 
summarized  as  follows: 

Proposition  6.13*  If  [v]  €  F*,  £  is  the  positive  number  associated 

to  [v]  in  the  definition  of  E,  k  is  an  arbitrary  positive  number, 
then  there  exist  two  positive  numbers  K  and  Q  such  that  for  all 


[uj  and 

[Ug]  e  F  with 

U) 

ddi^Mv])  and 

d( [u2], [v])  <  k 

(6.92) 

(ii) 

0(^1,  tv])  and 

a( [ug], [v])  <  Q 

(6.93) 

we  have 

(i) 

[u.,]  and  [ug]  € 

F* 

(6.94) 

(ii)  d([y([u1],[v])],[y([\o2],[v])])  <  Kc  ( [u^tug])  (6.95) 

Application  to  a  linear  system 

We  shall  assume  that  f(x,u,t)  has  the  particular  form  given  in 
relation  (2.15),  namely 

f(x,u,t)  *  A(u,t)x  +  Cp(u,t)  (6.96) 
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The  results  obtained  in  this  section  in  the  case  of  a  nonlinear 
system  can  a  fortiori  be  applied  to  a  linear  system.  We  want  to  show 
here  how  these  results  could  be  obtained  directly  in  the  case  of  a 
linear  system. 

Since  for  any  [u]  e  F  the  coefficients  of  Equation  (6.96)  are 
measurable  and  bounded,  we  know  that  a  solution  [x([u])]  will  always 
exist,  i.e.,  [u]  €  F*.  In  other  words,  we  have 

F  =  F*  (6.97) 


Hence  the  question  of  existence  of  a  solution  for  a  particular 
[u]  €  F  which  was  the  main  difficulty  in  the  treatment  of  the  general 
nonlinear  system,  given  in  the  beginning  of  this  section,  is  trivially 
solved  in  the  case  of  a  linear  system.  To  complete  this  direct  study 
of  a  linear  system,  we  shall  perform  seme  algebraic  manipulations  and 
prove  the  existence  of  uniform  bounds  for  some  constants  associated  to 
each  element  of  F. 

The  study  of  a  linear  system  is  particularly  simple  because  the 
matrix 


D(t;[w],z) 


OX 


x=x(t;[w],z) 


=  A(w(t),t)  for  all  t  [t^t^]  (6.98) 


is  independent  of  z.  From  there  follows  that  the  matrix  G(t;[w],z) 
will  also  he  independent  of  z  and  we  shall  write  G( t;  [w] )  instead 
of  G(t;[w],z).  Proposition  (6.7)  can  now  he  stated 
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(6-99) 


.  3LxgL.il 


and  from  the  relation 


x  =  x(tj[w],z) 


defining  the  mapping 


^_1(  [w] )  :  Z([w])  X  [ta,tfe]  -»  X  X  [t^t^] 
it  follows  that 


x  =  x(t;[v],0)  +  G  1(t;[v])z 


But  by  definition 


x(t;[w],0)  =  x(tj[w]) 


hence 


x  =  x(  t; [w] )  +  G  1(t;[w])z 


i.e., 


z  =  G(t;[w])(x  -x(t;[w])) 


If  we  compare  relation  (6.104)  with  the  relation 


y  =  G(t; [w])(x  -  x(t; [w] ) 


defining  the  mapping  (see  Section  2) 


(6.100) 


(6.101) 


(6.102) 


(6.103) 


(6.104) 


(6.105) 


(6.106) 
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»(M)  :  XX  -*Y([w])  X 


(6.107) 


we  obtain 


*([vl)  ■  $(M) 


(6.108) 


i  •  e  *  f 


Z([w])  X  [ta,tbl  =  Y( [w] )  X  [ta,tb]  (6.109) 


and  in  particular 


UUuUv])]  =  [y([u],[w])]  (6.110) 

For  a  linear  system  Propositions  (6*9)>  (6. 10),  (6.11)  and  (6.12) 
are  combined  into  the  following  result: 

Proposition  6.14.  For  any  positive  number  k,  there  is  a  number  P 
such  that  for  all  [u]  and  [w]  e  F^  we  have 

d([y([u],[w])])  <  Pcr(  [u],[v])  (6.111) 

Proof  of  Proposition  6.l4«  For  any  positive  number  k  there  exist 
uniform  bounds  for  each  of  the  expressions  on  the  right  side  of  the 
differential  equation  (2*26),  hence  there  exists  a  P  such  that 

ly(t;[uj,(w))|  <  P 

for  a.e.  t  e  [t  ,1^],  all  [u]  and  [w]  e  (6.112) 
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Moreover, 


y(ta;tu],[v])  =  0  (6.113) 

and 

y(t;[u],[w])  =  0 

for  all  t  c  [t  ,t  ]  such  that  u(t)  =  w(t)  (6.11U) 

a  d 

From  relations  (6.112),  (6.113)  and  (6.114)  we  immediately  have 

d([y([u],[v])])  <  Pa  ( [u],[w])  (6.115) 

This  completes  the  proof  of  Proposition  (6.14). 

Application  to  a  linear*  system 

If  we  assume  that  the  system  is  linear*,  i.e.,  if  the  function 
f(x,u,t)  has  the  particular  form  given  in  relation  (2.16): 

f(x,u,t)  =  A(t)x  +  <p(u,t)  (6.116) 

the  treatment  given  above  for  linear  systems  can  be  further  simplified, 
since  the  matrices  D(t;[w])  and  G(t;[w])  are  independent  of  [w] 
in  the  case  of  a  linear*  system.  We  shall  write  G(t)  instead  of 
G( t; [w] ) . 

The  differential  equation  for  [y([u],[w])]  becomes 

y(t;[u],[w])  *  G(t)(cp(u(t),t)-<p(w(t),t)) 

for  a.e.  t  €  [t  ,1^]  (6.117) 
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and  may  be  directly  integrated  to  give 


y(t;[u],[v])  =  /t  G(T)(q3(u(T),T)-cp(w(x),T))dT 
t 

a 

all  t  e  [ta,tb]  (6.118) 

which  implies 

x(t;[u])-x(t;[w])  =G_1(t)/t  G(-r)(cp(u(T),T)-<p(w(T),T))dT 

t 

a 

all  t  €  (6.119) 

The  results  of  Proposition  (6. 17)  may  then  be  immediately  read  off  from 
relation  (6.118). 

Relation  (6.119)  could  have  been  immediately  derived  from  the 
theory  of  nonhanogeneous  linear  differential  equations .  Indeed,  we 
have 

(x(t;[u])-x(t;  [w])) #  =  A(  t)(x(  t,  [u]  )-x(  t;  [w] )  )+cp(u(  t),t)-<p(w(  t)  ,t) 

for  a.e.  t  €  [t  ,t  ]  (6.120) 

a  d 

and  relation  (6. 119)  is  the  well-known  solution  of  the  differential 
equation  (6.120)  for  the  initial  condition 

_  j  [w] )  =  0  (6.121) 

o  a 
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SECTION  7 


Approximation  of  the  Comparison  Trajectories  in  the 
Comoving  Space  along  a  Given  Trajectory 

In  "([v])  X  [t  ,t.  ],  the  comoving  coordinate  space  along  the 
trajectory  [x([v])],  we  have  for  every  [u]  €  F*  a  trajectory 
[y(  [u],[v])]  which  is  the  image  of  the  trajectory  [x([u])]  and  a 
trajectory  [y([u],[v])]  which  is  a  certain  approximation  of  the  tra¬ 
jectory  [y( [u], [v] ) ] .  In  this  section  we  shall  study  the  properties 
of  this  approximation.  More  precisely,  this  section  will  be  devoted  to 
the  proof  of  Proposition  (7*3)  in  which  we  give  an  upper  bound  for  the 
uniform  distance  between  [y([u],[v])]  and  [y([u],[v])]  as  a  function 
of  the  distance  between  the  two  control  functions  [u]  and  [ v]  when 
this  distance  is  measured  with  respect  to  the  norm  a. 

We  remind  the  reader  that  the  results  of  this  section  are  highly 
trivial  when  the  system  is  linear*,  since  in  that  case  we  already  know 
that  [y([u],[v])]  s  [y([u],[v])]  (see  Section  2).  When  the  system 
is  linear  the  situation  is  not  trivial  but  nevertheless  very  simple  as 
is  shown  in  a  paragraph  at  the  end  of  this  section. 

Proposition  7*1*  Let  F(t,x)  be  an  n-dimensional  vector-valued  function 

defined  for  all  t  €  [t  ,t,  ]  and  all  n-dimensional  vectors  x  with 

a  b 

lxl  <  where  T|  is  a  fixed  positive  number,  such  that 

(i)  F(t,x)  is  measurable  with  respect  to  t  for  all  |x|  < 

uniformly  equicontinuous  with  respect  to  x  and  uniformly 

bounded  for  all  t  €  [t  ,t  ]  and  all  |x|  <  r\  (7*1) 

a  d  — 


69 


(ii)  11m  I  =  o  uniformly  for  t  €  [t  ,t.  ]  (7*2) 

M-0  Ixl  a  6 

then  there  exists  a  function  G(r)  defined,  continuous  and  nondecreasing 
over  [  0,  T|  ]  such  that 


(1)  lim  -^  =  0  (7-3) 

r  -»  0 

t. 

(11)  /  b  |F(t,a(t))|dt  <  G(d([a]))  (7-4) 

t 

a 

for  all  bounded  measurable  n-dimensional  vector-valued  functions  [a] 
such  that  d(  [a] )  <  rj . 

Proof  of  Proposition  7*1»  Let 

G(r)  =  (t  -  t  )  sup  | F( t,x) |  for  all  r  €  [0,n]  (7-5) 

b  a  M  <r 

t  € 

By  construction  G(r)  is  continuous  and  nondecreasing  over  [0,ri]. 

We  also  have 

lim^-  =  0  (7-6) 

r-»  0 


since  we  have  assumed  that 

lim  1  _  o  uniformly  for  t  e  [t  ,t,  ]  (7*7) 

I x |  -»  0  1*1  a  b 
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Moreover, 


t, 

/  |F(t,a(t))  |dt  <  (t  -  t  )  sup  |F(t,a(t))| 

“tVV 


U|<  d(  [a] ) 


<  G(d([a])) 


(7-8) 


This  concludes  the  proof  of  Proposition  (7*1). 

Proposition  7  »2  »  Let  K(t,x,u)  be  an  n-dimensional  vector-valued 

function  defined  for  all  t  e  [t  ,  t,  ]«  all  n-dimensional  vectors  x 

ah 

with  |x|  <  tj,  all  r -dimensional  vectors  u  with  |u|  <6,  where  n 

and  b  are  fixed  positive  numbers,  such  that 

(i)  K(t,x,u)  is  measurable  with  respect  to  t  and  u, 
uniformly  equicontinuous  with  respect  to  x  and  uni¬ 
formly  bounded  for  all  t  c  [t  ,t,  ],  all  |xl  <  n 
and  all  |u|  <  6  (7-9) 


(ii) 

K(t,0,u)  =  0 

all  t  e  [t  ,t.  ],  all 
a  d 

|u|  <  6 

(7.10) 

(iii) 

K(t,x,0)  =  0 

all  t  €  [tg^t^],  all 

lxl  <  n 

(7.11) 

then  there  exists  a  function  H(r)  defined,  continuous  and  nondecreasing 
over  [  0,  t]  ]  such  that 

(i)  H(0)  »  0  (7-12) 

t. 

(ii)  /  |K(t,x( t),u( t) ) |dt  <  G(d( [x] ) )  c ( [u] )  (7-13) 

t 
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Proof  of  Proposition  7.2.  Let 


H(r)  =  sup  |K(t,x,u)|  (7.1*0 

t€  [ta,tb] 

|x|<  r 

|u|<  6 

Then  H(r)  is  continuous  and  nondecreasing  over  [0,n]  since  K(t,x,u) 

is  uniformly  equicontinuous  with  respect  to  x  for  all  t  €  [t  ,t  ] 

a  d 

and  all  |u|  <  6.  We  also  have 

H(0)  =  0  (7-15) 


since  we  have  assumed  K( t,0,u)  =  0.  Moreover, 


t  t 

/  |K(t,x(t),u(t)|dt  <  /D  |K(t,x(t),u(t))|dt 

t  t 

a  a 

u(t)  b  0 


<  / 


•S 


sup  |K(t,r,v)|dt 

t£[W 


u(t)  ^  0  lrl<d([xD 
I  v|  <  8 


<  /D  H(d( [x] ) )dt 
t 

a 

u(t)  to 


<  H(d([x])) a  ( [u] ) 


(7-16) 


This  concludes  the  proof  of  Proposition  (7*2). 
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Proposition  7*5»  If  [v]  c  F*  and  if  Q  is  the  positive  constant 
introduced  in  Proposition  (6.9)#  then  there  exists  a  function  g(r) 
defined,  continuous  and  nondecreasing  over  [0,Q]  such  that 

(i)  llm  4^=0  (7.17) 

r  -4  0 

(ii)  for  all  [u]  e  F*([v],k)  with  a([u],[v])  <  Q,  we  have 

d([y([u],[v])],[y([u],[v])])  <  g(cr([u],[v]))  (7 -3.8) 

Proof  of  Proposition  7»5*  By  definition,  we  have 

y(t;[u],[v])=  G(tj[v])(x(tj[u])-x(tj[v]))  all  te  [^,1^]  (7*19) 

By  differentiation  with  respect  to  t  of  relation  (7*19)>  we  obtain 
y(t;[u],[v]) 

=  G(t;[v])(x(tj[u])-x(tj[v])) 

+  G(t;[v])(x(t;[u])-x(t;[v])) 

=  -  G(t j [v] )D(t; [v])(x(t; [u])-x(tj [v])) 

+  G(tj[v])(f(x(t;[u]),u(t),t)-f(x(t;[v]),v(t),t)) 

=  G(tj[v])(f(x(t;[v]),u(t),t)-f(x(tj[v]),v(t),t)) 

+  P(tjx(t;[u])-x(t;[v])) 

+  K(t;x(t;[u])-x(tj[v]),u(t)-v(t))  a.e.  t  €  [tR,t^]  (7*20) 

where  the  functions  F  and  K  are  defined  by 
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F(tjx)  =  -G(t;[v])D(tj[v])x  +  G(t;[v])(f(x(tj[v])  +  x,v(t),t) 

-  f(x(tj[v]),v(t),t)  (7-21) 

K(t;x,s)  =G(tj[v])(f(x(t;  [v])+  x,v(t)  +  6,t)  -  f(x(t; [v] ),v(t)  +  s,t) 

+  f(x(t;[v]),v(t),t)  -  f(x(t;[v])  +  x,v(t),t))  (7.22) 

By  definition  we  have 

y’(t; [u], [v] ) =G(t;[v])(f(x(tj[v]),u(t),t)  -  f(x(t; [v]),v(t),t)) 

for  a.e.  t  €  [t  ,t,  ]  (7*23) 

a  d 

From  relations  (7*20)  and  (7*23),  we  obtain 
y(t;[u],[v])  -  y’(t;[u],[v]) 

=  F( t ;x(t; [u] )  ■  x(t; [v] ) )  +  K(tjx(t; [u] )  -  x(t; [v] ),u(t)  -  v(t))  (7-24) 

By  definition  we  also  have 

y(t „;[u],[v])  =  y(t  ;[u],[v])  =  0  (7-25) 

ct  cl 

Hence,  from  relations  (7*2*0  and  (7*25),  we  may  write: 

a(  [y(  tu],[v])],[y(  [u],[v])]) 

t.  t 

<  /  b|F(tjx(tj[u])-x(tj[v]))|dt+/  °|K(tjx(t;[u])-x(tj[v]),u(t)-v(t))|dt 

t  t 

a  a 

(7.26) 
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Let  us  first  estimate  the  integral 


t 

f°  |p(tjx(tj[u])-x(tj[v]))|dt  (7.27) 

t 


From  the  definition  of  F(t;x)  we  have 


F(t;0)  =:  0  (7.28) 


and 


Fx(t;0)  =-  G(t;[v])D(tj[v])  +G(t;[v])D(t;[v])  =0  (7-29) 


From  relations  (7*28)  and  (7.29)>  we  then  have 


lim  l£(-t|X)  [  =  o  (7.50) 

M  -»  0  |  x  | 


By  Proposition  (7.1)  there  then  exists  a  function  G(r)  defined,  con¬ 
tinuous  and  nondecreasing  over  [0,  such  that 

(i)  lim  =  0  (7.31) 

r -*  0  r 


t. 

(ii)  /b  |P(t;x(t;[u])- x(t;(v]))|dt  <  G(d( [x( [u] ) ], [x( [v] ) ] ) 
t 

a 

(7.52) 


From  Proposition  (6.11)  we  know  that 


d([x([u])Ux([v])])  <  Po([u],[v])  (7-33) 
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If  we  define  G^r)  *  G(Pr),  relations  (7*32)  and  (7.33)  may  then 
he  written 

t- 

/  b  |F(t;x(t;[u])-  x(t;[v]))|dt  <  G  ( o(  [u] ,  [v] ) )  (7*3^) 

t 

a 

By  definition  G^(r)  is  continuous,  nondecreasing  over  [0,Q]  and 
such  that 

G-  (r) 

lim  =  0  (7*35) 

r->  0 

Let  us  now  estimate  the  integral 

/b  |K(t;x(t;[u])  -  x(t;[v]),u(t)  -  v(t))|dt  (7*36) 

t 

a 

From  the  definition  of  K(t,r,s)  we  have 

K(t,x,0)  =0  for  all  t  c  [t  ,t,  ]  and  all  x  with  |x|  <  (7*37) 

a  d  c 

and 

K(t,0,s)  =0  for  all  te  [t  ,t  ]  and  all  s  with  |s|  <  k  (7*38) 

a  d 

Hence  by  Proposition  (7*2),  there  is  a  function  H(r)  defined, 
continuous  and  nondecreasing  over  [0,  ^  ]  such  that 

(i)  H(0)  =  0  (7-39) 
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<  H(d([x(  [u])],[x([v])])a(  [u],[v]) 


(7A0) 


From  Proposition  (6.11),  we  know  that 


<1(  [x([u])],[x(  [v])])  <Po([u],[v])  (7-^1) 


If  we  define  G2(r)  *  rH(Pr)  then  relations  (7A0)  and  (7***l) 
may  be  written: 

/b  |K(t;x(t;[u])  -  x(t;[v]),u(t)  -  v(t)) |dt  <  G  (a( [u],[v]))  (7 .42 ) 
t 

a 

By  definition  G^(r)  is  continuous,  nondecreasing  over  [0,Q]  and 
such  that 


<Ur) 

lim  — -  =  0 

r->  0  r 


(7**0) 


Combining  relations  (7*26),  (7*3*0  and  (7***2)>  we  obtain 
d([y([u],[v])],[y(tu3,[v])])  <  G^duMv]))  +02(a([u],M))  (7*^) 


We  define 


g(r)  =  G^r)  +  Gg(r) 


(7-45) 


From  (7*35)  and  (7*^3)  it  follows  that  the  function  g(r)  is  contin 
uous,  nondecreasing  over  [0,Q]  and  such  that 


lim  =  0  (7^6) 

r  -*  0  r 

Moreover,  combining  relations  (7*44)  and  (7*45),  we  obtain 

<i(  [y(  [u],  [v] ) ],  [y(  [u],  [v] )  ] )  <  g(a(  (u], [v]))  (7.^7) 

This  concludes  the  proof  of  Proposition  (7*3)* 

Application  to  a  linear  system 

We  shall  again  consider  the  linear  system 

f ( x,u, t )  -  A(u,t)x  +  cp(u,t)  (7*48) 

introduced  in  Section  2  and  show  how  the  results  of  this  section  could 
be  obtained  directly  for  a  linear  system. 

In  Section  2  we  have  derived  the  following  relations: 

y(t;[u],[v])  -y'(tj [u], [v] ) 

=  G(tj[v])(A(u(t),t)  -A(v(t),t))G‘1(t;[v])y(t;[u],(v]) 

for  a.e.  t  e  (7»^9) 

y(ta;[u],[v])  =  y(t&;[u],[v])  =0  (7-50) 

We  know  that  the  coefficient  G(tj  [v])(A(u(t),t)-A(v(t),t))G  1(t;[v]), 
occurring  in  the  right  side  of  Equation  (7-^9)»  is  uniformly  bounded  by 
some  constant  V  for  all  [u]  and  [v]  €  F^,  and  we  have  proved  in 
Proposition  (6.14)  that  there  exists  a  positive  constant  P  such  that 
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d([y([u],[w])])  <  Po(  [u],[w]) 


(7-51) 


From  relations  (7-^9)  and  (7 .51),  it  follows  that 

|y(t;[u].[v])  -y‘(t;[u],[v])|  <  VP  o(  [u],  [v] ) 

for  a.e .  t  e  (7.52) 

Moreover,  we  immediately  see  from  relation  (7.49)  that 

y(t;[u],[v])  -  y‘(t;[u],[v])  =0 

for  all  t  e  [t  ,t  ]  such  that  u(t)  =  v(t)  (7*55) 

a  d 

If  we  write 

P*  =  PV  (7.54) 

then  from  relations  (7-50),  (7-52),  (7-53)  and  (7.5*0,  we  immediately 
obtain  the  following  result: 

Proposition  7*4.  If  k  is  a  positive  real  number,  then  there  is  a 
positive  constant  P*  such  that 

d(ty([u],[v])],[y(tuUv])])  <  P*(a(  [u],  [v] )  )2 

for  all  [u]  and  [v]eFk  (7<55) 

In  the  case  of  a  linear  system,  Proposition  (7-*0  implies 

O 

Proposition  (7*5)  since  the  function  g(r)  =  P*r  satisfies  the  con- 

P 

ditions  of  Proposition  (7*5):  P  r  is  continuous,  nondecreasing  for 

positive  r  and  lim  -  *  0. 

r  -»  0  r 
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SECTION  8 


The  Range  of  a  Vector  Integral  over  Borel  Sets 

In  this  section  we  shall  derive  some  properties  of  the  set  H([v]). 

In  Proposition  (8.11)  we  shall  prove  that  the  set  H([v])  is  convex. 

This  result  will  be  used  later  in  the  proof  of  Theorem  IV.  Let 
+ 

r  :  F  ->H([v])  be  the  mapping  which  maps  an  element  [u]  of  F  into 
+  + 

the  element  y(tfe;  [u],  [v])  of  H([v]).  In  Proposition  (8.13)  we  shall 
prove  that  if  0  is  an  interior  point  of  the  set  H([v])  then  there 
is  a  subset  F([v])  of  F  such  that  0  is  also  an  interior  point  of 
the  image  of  F([v])  under  the  mapping  T  and  such  that  the  restric¬ 
tion  to  F([v])  of  the  mapping  T  has  a  continuous  inverse.  We  shall 
need  Proposition  (8.13)  in  the  proof  of  Theorem  III. 

We  shall  assume  that  the  reader  knows  the  basic  elements  of  the 
theory  of  measure  which  can  be  found  in  HaLmos1  book,  "Measure  Theory". 

First  let  us  recall  some  classical  notations  and  definitions: 

1.  If  A  is  a  set,  CL  a  a-algebra  of  subsets  of  A,  m-  a  non¬ 
negative  measured  defined  on  with  \i{k)  <  +«,  then  (A,  Q.  ,n)  is 
called  a  measure  a-algebra. 

2 .  An  element  B  e  Ci  is  called  an  atom  of  the  measure  o-algebra 
(A,  d  ,\x)  if  n(B)  b  0  and  if  dC  B  with  D  €  implies  either 

n(D)  =  o  (8.1) 

or 

\x(D)  *  \x{3)  (8.2) 
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3.  A  measure  a-algebra  (A,  G[  (i)  is  called  nonatomic  if  it  has 
no  atom. 

Proposition  8.1.  If  (A,  (%  ijl )  is  a  nonatomic  measure  a-algebra, 

B  e  0^}  C( g  =  (D  i  D  c  DC  B}  then  (B,  is  a  nonatomic 

measure  a-algebra. 

Proof  of  Proposition  8.1.  Let  Z  ^  D  be  the  minimal  a-algebra 

containing  (L  *  We  have  by  construction  Z  _  C  Z ,  moreover, 

D  €  implies  DC  B,  hence  Z  C  In  other  words, 

B  B  B 

=  #B  and  (B,  #B,d)  is  a  measure  a-algebra.  Finally, 

(B,  )  is  nonatomic  since  any  atom  of  (B,  Z ',\±)  is  also  an 

B  B 

atom  of  (A,  $  ,\±) .  This  concludes  the  proof  of  Proposition  (8.1). 

Proposition  8.2.  If  (A,  Z  ,\i)  is  a  nonatomic  measure  a-algebra, 

Za  =  {B  :  B  €  Z,  u(B)  =  au(A)),  then  there  exists  a  nest^C 
such  that  ^  D  da  h  0  for  all  a  €  [0,1] . 

Proof  of  Proposition  8.2.  Let  f/  be  a  maximal  nest  in  Z*  Such  a 
next  exists  by  the  Hausdorff  Maximal  Principle.  Let  ^  Pi 

““  Sl"Ce 

0  e  similarly  ^  0  since  A  e  Let  N+  =  PlB  +  , 

Be^a 

N  =  U  B  .  Since  ^7  is  a  maximal  nest  over  the  a-algebra  (\  , 
u 

tten  »'  e  f{,  l£«  ?[,  «(»')  <  a,  n(N^)  >  a,  N“  C  N*  and  there 

is  no  N  €  such  that  N  ^  N L,  N  £  ft+ ,  ?’*  C  K  C  N^.  Moreover, 

u  u  u 
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we  have  n(NQ )  =  |i(N*  )  =  a.  Otherwise,  from  the  nonatomicity  of 

(A,  #  ,ti),  there  would  he  a  subset  K  of  N*  ~  N  such  that  n(K)  0  0 

cx  cx 

and  u(K)  h  u(N*  ~  n"  )  =  n(N*)  -  n(N"),  i.e.,  such  that  NjUK  0  n’, 

N’UK  ^  N*  and  N^C  N^UkC  N*  which  contradicts  our  previous 

results.  For  every  a  €  [0,1]  we  have  exhibited  elements  N  and  N* 

CZ  cz 

in  fl  =  7lC\  $  >  This  concludes  the  proof  of  Proposition  (8.2). 

'  cx  ^  cz 

Proposition  8.3*  If  (A,  $  ,p)  is  a  nonatomic  measure  o-algebra,  then 
there  exists  a  set  £  -  {Da  :  a  e  [0,1]}  such  that 

(i)  Da  e  C[  all  a  e  [0,1]  (8.3) 

(ii)  n(Da)  -  an  (A)  (8.4) 

(iii)  Da  C  Da  if  and  only  if  a±  <  ag  (8.5) 

Proof  of  Proposition  8.3.  For  every  a  e  [0,1]  let  be  an  element 

of  the  nonempty  set  fl .  Such  D  exists  by  the  axiom  of  choice. 

^  CZ  cz 

The  conditions  (i),  (ii)  and  (iii)  are  then  satisfied  by  construction. 
This  concludes  the  proof  of  Proposition  (8.5)* 

Proposition  8 A.  If  (A,  ,^)  is  a  nonatomic  measure  c-algebra, 

(ai  :  i  =  1,  . . .  ,  k)  a  finite  set  of  nonnegative  real  numbers,  then 


there  exists  a  set  [A^  :  i  =  1,  . . .  ,  k)  such  that 

u) 

Aj  c  ^  for  i— 1,  ...  ,  k 

(8.6) 

(ii) 

Ai ^  Aj  =  0  for  i  =  1,  . . .  ,  k;  J  =  1,  . . .  ,  k 

and  i  t  J 

(8.7) 
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(ill)  U  A  =  A 

1-1  1 

(8.8) 

a. 

(iv)  \i(A  )  =  -  u(A) 

(8.9) 

l  a 

J=1  J 

Proof  of  Proposition  8.4.  Let 

ai 

a  -  k  for  1=1,  . . .  ,  k 

(8.10) 

J*i  0 

By  Proposition  (8.3)  there  is  a  ^  €  C(  such  that 

^(A1)  =  a^A) 

(8.11) 

Let 

A^^  =  A  ~ 

(8.12) 

and 

(8.13) 

By  Proposition  (8.1),  (A^^,  is  a  nonatomic  measure  a-algebra 

and  by  Proposition  (8.2)  there  is  a  €  Q  such  that 

^Ag)  =  Og|i(A(l))  (8.U0 

Let 

and 


I 

II 

s 

(8.15) 

^2  =  \{2) 

(8.16) 

) 
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By  Proposition  (8.1),  (A^,  Q ^ ,\i)  is  a  nonatomic  measure  o-algebra 

and  by  Proposition  (8.2)  there  is  a  ^  such  that 

n(Aj)  =a5n(A(2))  (8.17) 

By  repeating  k  -  1  times  the  same  process,  we  obtain  a  set 
(A^  :  i  =  1,  . . .  ,  k)  .  It  is  a  trivial  matter  to  verify  that  the  set 
(Ai  :  i  =  1,  . . .  ,  k)  satisfies  conditions  (i)  to  (iv).  This  concludes 
the  proof  of  Proposition  (8.4). 

Proposition  8.5*  If  (A,  5  ,n)  is  a  nonatomic  measure  o-algebra,  f 

a  bounded  and  ©  measurable  function  over  A,  there  is  a  set  D  €  2? 

with  \i(  D)  =  £u(A),  /  f  du  =  f  d*i. 

D  A 

Proof  of  Proposition  8.5*  Fran  Proposition  (8.2)  there  is  a  subset 
^=(Da  :  a  €  [0,1])  of  S  such  that 

U)  n(Da)  =  an(  A  ) 

(11>  \C\  ltt 

Let  Ba  “  Da  ~  all  Cl  e  hence  BQ  e  ^  and 


We  then  have 


*(\)  +  «(Bi)  =  1 


(8.18) 


since  (G^)  -  ®( Dj) )+  (®(D^)  -®(  DQ) )=  (l -  £)  +  (■§•- 0)  =1.  Moreover, 


®(Bg)  is  continuous  over  a  e  [£,1] 


(8.19) 


since 


|*(B  )  -  *(B  )|  <  UKD^)  -*(De^_  |))  -  (*(D^)-  *(D^.  |))| 

<  |*(D  )  -*(D  )|  +  ^(D^.  |)-*(D^.  ^)| 

<  M|n(D^)  -h(Dq^)|  +  M|h(D(^.|)-h(Dc^.  j)| 

<  Mla^-  a2|  +m|(«l-  J)  -  (ag-  £)|  =  2M|  o^-  a2| 

From  (8.19)  we  know  that  *(B^  ^  =  (®(Ba)  :  a  e  [£,l]}  is  a  segment. 
Moreover,  *(B[£  1]^  contains  ^(B.^)  and  ®(B^)  =  1  -  ®(B1),  from 
(8.18),  hence  ®(B^  1j)  contains 

*(B1)+  *(B^)  *(B1)+ (1- *(B1)) 

=  5  =  ? 


since  a  segment  is  a  convex  set. 

Let  a  €  [£,l]  be  such  that  <&(B-)  =  £  then  D  =  B -  is  the 
requested  set. 
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and 


(8.20) 


If  /  f  dii  «  0  let  *(s)  *  /  f  du  then 
A  s 

®(B1)  +  O(B^)  =  0 

$(Bg)  is  continuous  for  a  e  [*£,1]  (8.21) 

for  the  same  reasons  as  (8.18)  and  (8.19)* 

Fran  (8.20)  and  (8.21),  we  conclude  as  before  that  there  is  a 
a  e  [£,l]  such  that  #(B^)  =  0.  This  ends  the  proof  of  Proposition  (8.5)* 

Proposition  8.6.  If  (A,  $,u)  is  a  nonatanic  measure  a-algebra,  f  a 
uniformly  bounded  and  G  measurable  function  over  A,  $  the  minimal 
a-algebra  over  *  (Di  :  i  *  0,  1,  2,  ...  ,  }  where  the  sets 
have  the  following  properties: 


o° 

II 

> 

(8.22) 

Dnc  t> 

(8.23) 

D2n+1  ^  D2n+2  =  ^ 

(8.2lf) 

D2n+1 ^  D2n+2  =  Dn 

(8.25) 

^ =  *l(D2n+2)  =  * 

(8.2 6) 

/  fd n  =  /  f  ai*  -  i  /  f  dn 

4n+l  \n+2  Dn 

(8.27) 

then 

/  f  dn  -  /  f  dn  for  all  D  c  ^  (8.28) 

D  A 
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Proof  of  Proposition  8.6.  Let 


xi(c>  ■  41  <8-29) 

UD)  =  /  f  d»i  (8.30) 

*  D 

We  then  have  to  prove  that  X^D)  -  X2(D)  for  a11  ®  €  ^  *  We 

may  assume  without  loss  of  generality  that  /  f  d|i  >  0*  Indeed,  the 

A 

case  /  f  d|x  <  0  can  be  reduced  to  the  case  /  f  d^i  >  0  by  intro- 
A  A 

ducing  the  function  f *  »  -  f . 

1*  The  proposition  is  true  for  each  _in 

If  k  is  a  nonnegative  integer  then  there  is  a  unique  sequence  of 
nonnegative  integers  k0>  *1  ,  . . .  ,  k^  with 

k0  =  0  (8.31) 

kl+l  =2ki+1  or  k1+i  =2ki+2»  i  =  0,  1,  ...  ,  n-l  (8.32) 
kn  =  k  (8.33) 

hence 

/  f  d|i  =  £  /  f  d+i  (8. 3*0 

\+l  \ 

and 

n(ix  )  =  £  H(D  )  for  all  i  =  0,  ...  ,  n-l  (8.35) 

Ki+1  Ki 

Relations  (8.3^)  and  (8.35)  imply 
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M(I4,  ) 

/  f  *  -  TTTd TT"  '  f  134 

ki+i  1  i 

all  i  =  0,  • • ♦  p  n-1 

(8.36) 

i«6*| 

4(Dk  ) 

/  f  *  -  OTTT  J  f  * 
°k  X  \ 

n  o 

(8.37) 

or 

H(D.) 

*  ’  RaT  J  f  * 

all  D,  e  •& 
k 

(8.38) 

and 

MV  -vv 

all  Dk  €  <& 

(8.39) 

Hence  the  proposition  is  true  for 

each  e 

2 .  The  proposition  is  true  for  each  D  in  the  minimal  al«ehra 

<=#- 

over 

If 

D  € 

(8.40) 

then 

k 

D  =  U  D 

n 

(8.41) 

with 

D  O  D  ■  0  for  1  ^  J  (8.42) 

ni  nJ 
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From  definition  (8.29)  and  relations  (8.1*1)  and  (8.1*2),  we  obtain 

X  (D)  =  £  \  (D  )  (8.1*3) 

i=l  x  i 

Similarly,  from  definition  (8.30)  and  relations  (8.4l)  and  (8.42),  we 
obtain 

Xp(D)  =  £  Xp  (D  )  (8.1*1*) 

2  i=l  2  ni 

From  relations  ( 8 . 39) >  (8.43)  and  (8.44),  it  follows  that 

XX(D)  =  Xg(D)  (8.45) 

Hence  the  proposition  is  true  for  each  D  € 

3.  The  proposition  is  true  for  each  D  e  0^ 

The  position  is  the  following: 

(a)  and  X^  are  two  set-valued  functions  defined  over  the 
a-algebra  23  . 

(b)  X1  is  a  measure  over  23  since  (j  is  a  measure  over  $ 

(c)  X^(D)  *  Xg(D)  for  all  D  ,  where  &  is  a  subalgebra 

of  e. 

Since  Q  ,  the  a-algebra  generated  by  the  algebra  J,  is  by- 
construction  a  subalgebra  of  the  a-algebra  0  ,  it  follows  from  the 
theorem  on  the  uniqueness  of  the  extension  of  a  measure  that 
X^( D)  =  Xg( D)  for  each  D  €  (K •  (See  Halmos,  Measure  Theory,  page  51*)  • 
This  concludes  the  proof  of  Proposition  (8.6). 
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Proposition  8.7.  If  (A, 8  ,Li)  is  a  nonatomic  measure  a-algebra,  f 
a  bounded  and  ©  measurable  function  over  A,  then  there  is  a  non¬ 
atomic  measure  a-algebra  (A,  ,\l)  with  3  such  that 


/  f  dn  =  m  J  f  dy  for  all  D  €  &  (8.46) 

D  A 


Proof  of  Proposition  8.7*  Let  us  call  D  =  A  and  SD-  3  .  Let 

o 

P^  be  the  following  procedure: 


If  (Dn,  5^  ,u)  is  a  nonatomic  measure  a-algebra,  then  by 
n 

Proposition  (6.1)  there  are  sets  E^n+l  and  D2n+2  € 

n 

such  that 


D2n+1 n'  D2n+2  ”  ^ 


D2n+1  ^  D2n+2  Dn 


tl(r,2n+l)  = 


2n+2 


/  fdn=/  f  dji  -  ^  /  f  dn 


2n+l 


^2n+2 


(8.47) 

(8.48) 

(8.49) 

(8.50) 


*”d  <%»!>  \ntl'  *>  “a  <Dan«-  “> 

measure  a-algebras  by  Proposition  (8.1). 


are  nonatomic 


Let  us  apply  recurrently  Pr  to  (D^,  ©D  >  |a)  for  n  «  0,  1,  2,  ...  . 

n  n 

This  is  possible  since  Dq  and  5^  are  given.  Let  C\  be  the  mini- 

o 

mal  a-algebra  generated  by  (D^  :  i  =  0,  1,  2,  . . .)  .  By  construction, 
(A,  $,^i)  is  a  nonatomic  measure  a-algebra  and  Moreover,  ^ 

satisfies  the  assumptions  of  Proposition  (8.6),  hence 
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I'*- 


for  all  D  €  <Z  (8.51) 


This  concludes  the  proof  of  Proposition  (8.7). 

Proposition  8.8.  If  (A,5,|i)  is  a  nonatomic  measure  a-algebra, 
f  =  (f1,  ...  ,  fn)  an  n-dimensional  vector-valued,  bounded  and 
£$  measurable  function  over  A,  there  is  a  nonatomic  measure  o-algebra 
(A,  aCB  such  that 


/  f  dH  =  /  f  dn  for  all  D  €  (8.52) 

L  A 

Proof  of  Proposition  8.8.  Let  us  apply  Proposition  (8.7)  to  f*'  over 
(A,  $ ,\x)  and  let  (A,  C  6  be  the  nonatomic  measure 

a-algebra  so  obtained.  More  generally,  let  us  apply  for 
i  =  1,  2,  ...  ,  n-1,  Proposition  (8.7)  to  fi+1  over  (A,^,n)  and 
let  (A,  ^+1,u),  tfi+1  C  be  the  nonatomic  measure  a-algebra  so 
obtained. 

Let  <z-  a .  We  then  have 


/  f1  d|i  =  *4$-  /  f1  dn,  i=l»  •••  >  n  for  all  D  e  (2  (8.53) 

D  ]  A 

i.e., 

/  f  /  f  dp,  for  all  D  €  (X  (8. 54) 

D  ^ }  A 


This  concludes  the  proof  of  Proposition  (8.8). 
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Proposition  8.9«  (Lyapounov's  Theorem)  If  (A,#,n)  is  a  nonatomic 
measure  c-algebra,  f»  . ..  ,  fn)  an  n-dimensional  vector¬ 

valued,  hounded  and  23  measurable  function  over  A,  then 

R  -  (/  f  dn  :Be5)  (8.55) 

D 


is  convex. 

Proof  of  Proposition  8.9*  It  is  enough  to  prove  that  if  b^  and 
b2  e  R,  then 


{ab^  (l  -  a)b2  :  a  €  [0,1] }C  R  (8.56) 

If  b^  and  b^  e  R,  then  there  are  and  e  3  such  that 


/  f  du  =  b  for  i  =  1,  2  (8.57) 

Bi 

Let 

f  =  fX(Bi)  for  i  =  1,  2  (8.58) 

and 

f*  =  (f^,f^,  ,  fj,  ...  ,  f”)  (8.59) 

If  we  apply  Proposition  (8.8)  to  f*  over  (A,  3,n)>  we  obtain 
a  nonatomic  measure  a-algebra  (A, ^,fi),  5 •  Let  <&= 

(Dq  :  a  e  [0,1])  be  a  subset  of  O,  such  that 
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all  a  €  [0,1] 


(8.60) 


h(dq)  =  a  (i  (A) 


£ 


exists  by  Proposition  (8.3). 


We  then  have 


OU  f  du  :  X  =  [DanB1]U[(A~Da)OB2],  a  e  [0,1]}  (8.6l) 

X 

=  {/  f  dn  +  /  f  du  :  a  €  [0,1])  (8.62) 

D  X  A~D 
a  a 

^(Dj  u(A~  D  ) 

=  {I^Ar  {  fl  *  +  UJT~  l  f2  ^  :  a  €  C°'l])  (8,65) 

=  {a^  +  (l-a)b2  :  a  e  [0,1]}  (8.64) 

This  concludes  the  proof  of  Proposition  (8.9)* 

Proposition  8.10.  If  (A,  T5 ,\x)  is  a  nonatomic  finite  measure  a-algebra, 
X  is  an  n-dimensional  Euclidean  space,  S  is  a  class  of  bounded  3 
measurable  functions  from  A  to  X  such  that  f  and  g  €  S  implies 

fX(B)  +  gX(A  -  B)  e  S  for  all  B  e  S  (8.65) 

then 

L(S)  =  (/  f  *  ;  f  «  3)  (8.66) 

A 

is  convex. 

Proof  of  Proposition  8.10.  Let  f  and  g  €  S.  We  shall  prove  that 
there  is  a  set  L(f,g)  such  that 
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(f.) 

L(f,g)  is  convex 

(8.67) 

(ii) 

L(f,g)CL(S) 

(8.68) 

(iii) 

/  f  and  /  g  dn  €  L(f,g) 

A  A 

(8.69) 

The  existence  of  such  a  set  L(f,g)  is  a  sufficient  condition  for 
the  convexity  of  the  set  L(S). 

Let 


L(f,g)  =  U  (fX(B)  +  g*(A~B))d»i  :  B  €  5)  (8.70) 

A 


and 

L*(f,g)  =  {-/  g  du  +  a  :  a  c  L(f,g))  (8.71) 

A 


We  may  write  L*(f,g)  =  {/  (f-g)X(B)dn:  B  e5),  hence  L*(f,g)  is 

A 

convex  by  Proposition  (8.9)*  The  convexity  of  L(f,g)  then  follows 

from  the  convexity  of  L*(f,g).  This  proves  relation  (i). 

Relation  (ii)  is  an  immediate  consequence  of  the  definitions  of 

L(S)  and  L(f,g).  Moreover,  for  B  *  A  e  B  we  have  /  f  dp,  e  L(f,g) 

A 

and  for  B  *  0  t  $  we  have  /  g  dji  €  L(f,g).  This  proves 

A 

relation  (iii)  and  concludes  the  proof  of  Proposition  (8.10). 


Proposition  8.11.  The  set  H([v])  is  convex. 

Proof  of  Proposition  8.11.  From  the  definition  of  H([v])  we  have 

+  t, 

H(  [v] )  «  ir  <p(t,u(t);[v])dt  ;  [u]  c  F)  (8.72) 

t 


9^ 


where 


Cp(t,u(t)j[v])  *G(tj[v])(f(x(t;[v]),u(t),t)  -  f(x(tj[v]),v(t),t))dt 

(8.73) 

From  the  definition  of  the  class  F  we  know  that  if  [u^ ]  and 

[u_]  e  F  and  if  B  is  a  measurable  subset  of  [t  ,t,  ]  there  exists 
2  8.  0 

an  element  [u]  e  F  such  that 

u(t)  =  u^(t)  for  all  t  e  B 

=  Ug(t)  for  all  t  e  [ta,tt)]~B  (8.74) 

Hence  the  class  S*  of  all  functions  cp(t,u(t); [v] )  with  [u]  €  F 
satisfies  condition  (8.65),  and  we  may  apply  Proposition  (8.10).  This 
concludes  the  proof  of  Proposition  (8.11). 

Notations:  If  f  and  g  are  elements  of  the  class  S  introduced  in 
Proposition  (8.10),  we  shall  write 

o(f,g)  =  n( (t: f(t)  t  g(t),  t  €  A)) 

and 

d(f,g)  =  ess  sup  |f(t)-g(t)| 
t  e  A 

The  remarks  made  in  Section  5  also  apply  to, these  norms. 

Proposition  8.12.  Under  the  assumptions  of  Proposition  (8.10),  if 

/  f  dji  is  an  interior  point  of  L(S),  then  there  is  a  subset  S(fQ) 
A 

of  S  and  two  positive  constants  m  and  k  such  that 


(8.75) 

(8.76) 
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(8.77) 


(i)  d(fo,g)  <  k  for  all  g  in  S(fQ) 

(ii)  the  mapping  S(fQ)  -»L(S(fo))  is  one-to-one  (8.78) 

(iii)  /  f  dn  is  an  interior  point  of  L(S(f  ))  (8.79) 

A 

(iv)  a(g,h)  <  m  |  /  g  d*i  -  /  h  d|i|  for  all  g  and 
A  A 

h  €  S(fQ)  (8.80) 

Proof  of  Proposition  8.12. 

Notations;  If  A  is  a  set,  then  int  A  is  the  set  of  interior  points 

of  A,  co  A  is  the  convex  hull  of  A,  and  int  co  A  is  the  set  of 

interior  points  of  the  convex  hull  of  A. 

If  /  fQ  du  €  int  L(S),  then  there  are  functions  f^,f^,  ...  ,  f^+1 

in  S  such  that  2q  e  int  co  M  where  M  =  {2^  :  i  =0,1,2,  ...  ,  n+l} 

with  |  »  /  f  d|i  for  1*0,1,  ...  ,  n+l.  Let  M±  =  (M-{iiJ)  for 
A 

i  =0,1,2,  ...  ,  n+l.  For  every  2  e  co  M  let  X(2)  « 

(Xi(2)  :  1*0,1,  . ..  ,  n+l}  be  defined  by  the  following  rules: 

(i)  k  is  the  smallest  integer  such  that  2  €  co 

(ii)  Xk(2)-0 

(iii)  X(2)  -  {X^(2)}  are  the  barycentric  coordinates  of  2 
with  respect  to  M^. 

By  construction  there  exists  a  positive  constant  m  such  that 
n+l 

£  I X . ( J ‘ )  -  X.(l") I  <  m  |l’-  j"|  for  all  l'  and  i"  in  co  M 
i-0  1 

(8.81) 
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Let 

9  *  •••  »  •••  f  ^2#  •••  9  (8.82) 

Let  (A,£?,ii)  with  5  be  a  nonatomic  measure  c-algebra 

such  that 

/  qp  du  *  /  9  ^  for  a11  D  e  ^  (8.85) 

D  ™ A)  A 

Such  an  C(  exists  by  Proposition  (8.8). 

Let  (Di  :  i*0,l,  . ..  ,  n+l}  be  such  that 


(1) 

€  % 

(8.84) 

(ii) 

D^Dj  =  t>  if  i  b  J 

(8.85) 

n+l 

(iii) 

U  D  =  A 
i=0  1 

(8.86) 

(iv) 

=  n+2^(A) 

(8.87) 

Such  set  (D1  :  i  =0,1,2,  ...  ,  n+l}  exists  by  Proposition  (8.4). 

Let  J& ^  be  a  nest  :  Of  e  [0,1]}  C  such  that 

(i)  a  <  a  implies  D^C  1)^  (8.88) 

(ii)  Dj1  =  (8.89) 

(iii)  n(  D^)  =  a*i(Di)  (8.90) 

Such  a  nest  exists  for  all  i  =0,1,2,  ...  ,  n+l  by  Proposition  (8.3) . 

We  shall  use  the  following  convention:  if  a  is  a  positive  number, 
the  set  a  co  M  is  defined  by 

a  co  M  *  (iQ  +  a (i  -iQ):  l  e  co  M}  (8. 91) 
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Let 


f<i>  “foxvA~£i  Di 


n+l  X1((n+2)i)\  n+l  /  X1((n+2)l) 


n+i  /  K ,Un+z  l  K 

&V&  ) 


for  all  l  e  co  M  (8.92) 


By  construction,  we  have; 

n+l  . 

,  »  f  „  )  =  Z  lM*  )  "  M^") I 

(1  )  (i  )  1=0  1  1 

for  all  i'  and  l"  e  -^coM  (8-93) 


Let 


S(fo>  ■  {f(,)  > 

i  e  n1t2  co  M) 

(8.9*0 

We  have  S(fQ)C  S  since  f^j  £  S  for  all  l  e 

i 

n  +2 

co  M 

because  C  <%C  3  for  all 

i  ” 0,1,2,  • • *  ,  n+l 

by  Proposition  (8.2 ) 

We  also  have,  by  construction. 

£ 

II 

for  all  l€  nl2 

co  M 

(8.95) 

1  •  e  •  , 

L(S(fo))  « 

ST?  «°M 

(8.96) 

But  since 

l0  €  int 

^  coM 

(8.97) 
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then  relation  (8.96)  implies 


l0  €  int  L(S(fo))  (8.98) 

Relation  (8.98)  is  the  required  property  (8.79)  •  Property  (8.78) 
is  satisfied  by  virtue  of  relations  (8.9*0  and  (8.95).  Property  (8.77) 

follows  from  the  fact  that  the  class  S(f  )  has  been  constructed  from 

o' 

a  finite  number  of  bounded  measurable  functions.  From  relations  (8.8l) 
and  (8.93)#  we  have 

a(f  .  f  )  <  m|  i '  -  j"  |  for  all  1*  and  i"  e  — co  M 
(I  )  (1  )  "  n+2 

(8.99) 

Relation  (8.99)  is  equivalent  to  property  (8.80)  by  virtue  of 
definition  (8.9*0  •  This  concludes  the  proof  of  Proposition  (8.12). 

Proposition  8.13-  If  0  is  an  interior  point  of  H([v])  then  there 
is  a  subset  F([v])  of  F  and  two  constants  m  and  k  such  that 

(i)  d([u],[v])<k  for  all  [u]e7([v])  (8.100) 

(il)  the  mapping:  F([v] )  -»  (y^;  [u],  [v])  :  [u]  €  F(  [v] )} 

is  one-to-one  (8.101) 

(iii)  0  is  an  interior  point  of  {y(t^;[u]f  [v])  :  [u ]  eF([v])) 

(8.ioe) 

(iv)  o([u1),  [Ug])  <  | 

for  all  [u^ ]  and  [Ug]eF([v])  (8.103) 

Proof  of  Proposition  8.13.  Let  S*  he  the  class  of  functions  intro¬ 
duced  In  Proposition  (8.11).  By  replacing  S  by  S*  in  the  proof  of 
Proposition  (8.12),  we  immediately  obtain  Proposition  (8.13). 


99 


SECTION  9 


An  Application  of  Brouwer^  Fixed  Point  Theorem 

In  this  section  we  prove  a  single  proposition  which  will  play  a 
fundamental  role  in  the  proof  of  Theorem  III,  given  in  Section  10. 

Proposition  9«1»  If  f  is  a  continuous  mapping  of  a  hall  S ^  ■ 

{x  :  x  e  E,  | x  1  <  n),  with  r\  >  0,  of  the  Euclidean  space  E  into  E 
such  that  there  exists  a  function  g(r)  defined,  continuous  and  non¬ 
decreasing  over  [0,n]  and  having  the  following  properties: 

(i)  lim  =  0  (9-1) 

r->  0  r  .  .....  .  .  ■ 

(ii)  |a- f(a)|  <  g(  |a|)  for  all  a  e  Sn  (9-2) 

then  0  is  an  interior  point  of  the  set  f(S^)  image  of  the  set  S1"* 
through  the  mapping  f . 

Proof  of  Proposition  9.1.  Let  o  e  (0,nl  be  such  that  g(o)  <  ^  , 

i.e..  QbLsI  <  i.  Such  a  p  exists  since  is  continuous  over 

*  P  P 

(0,n]  and  lim  ^ELbI  s  q  by  assumption, 
o— »  0  0 

Let  h  (x)  =  z  +  x  -  f(x).  If  z  e  then  h  maps  S°  into 

z  z 

itself  since  for  x  e  Sp  we  have 

|bz(x)l  -  +  lx-  f(x)l  £  f  +  e(  M)  <  f  +  g(p)  <  f  +  §  =  P 

(9-3) 
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Moreover,  the  continuity  of  the  mapping  f  implies  the  continuity  of 

the  mapping  h  ,  hence  by  Brouwer's  Fixed  Point  Theorem,  there  exists 
z 

an  x1  such  that 

hz(xl)  =  ^  (9-*0 


i  .e . , 


z  =  f(xL) 


(9-5) 


o/2 

For  all  z  e  Sp/  there  exists  an  such  that  relation  (9*5)  holds, 

hence 

sp/2Cf(sB)Cf(s'1)  (9.6) 


The  point  0  is  then  an  interior  point  of  the  set  f(S^).  This 
concludes  the  proof  of  Proposition  (9*l)* 


Remark.  A  weaker  form  of  this  proposition,  corresponding  to 
2 

g(r)  =  Mr  for  some  0  <  M  <  +»,  has  been  introduced  without  proof 
in  a  previous  publication  [12].  An  elegant  proof  for  the  case 
g(r)  =  Mr  has  been  communicated  to  the  author  by  Dr.  G.  S.  Jones  of 
RIAS.  The  proof  given  here  for  the  larger  class  of  functions  g 
described  in  the  statement  of  Proposition  (9*1)  is  a  generalization 
of  Dr.  G.  S.  Jones*  proof. 
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SECTION  10 


Proofs  of  the  Theorems  of  Section  k 

Proof  of  Theorem  I*  If  x  =  x(t^j[v])  is  an  interior  point  of  the 
set  H,  there  is  an  £  >  0  such  that 

|  =  ( x^  t^j  [v] ), x2(  t^;  [v] ),..., x11  1(tb;[v]),xn(tb;[v])+£)  (10. l) 

is  also  a  point  of  the  set  H,  hence  there  is  a  [u]  in  F*  such 
that 

xO^tu])  =  |  (10.2) 

We  then  have 

xn(tb;[u])  =  xn(tb;[v])  +  £  >  x^t^jM)  (10.3) 

But  by  construction  we  also  have 

(e,^)  cb  (io.4) 

Relations  (10.2),  (10.3)  and  (10.4)  contradict  the  assumed  opti¬ 
mality  of  the  element  [v]  of  F*,  hence  x  =  x( t^ ; [  v] )  is  a  boundary 
point  of  the  set  H.  This  concludes  the  proof  of  Theorem  I. 

Proof  of  Theorem  II.  The  mapping  H-»H([v])  is  defined  by 

y  =  0(tb;[v])(x  -  x(tl5j[v]))  (10.5) 

But  G( t^j [v] )  is  the  identity  matrix,  hence  H([v])  is  a  simple  trans¬ 
lation  of  H.  This  translation  conserves  the  topological  properties 


of  the  points  in  H;  in  particular  to  a  boundary  point  of  H  corre 
sponds  a  boundary  point  of  H([v])  and  conversely. 


Proof  of  Theorem  III*  Let  us  assume  that  0  is  interior  to  H([v]) 
and  show  that  0  is  then  interior  to  H([v]). 

If  0  is  interior  to  H([v])  then  by  Proposition  (8.13)  we  know 
that  there  exists  a  subset  F([v])  of  F  and  two  constants  m  and  k 


such  that 

(i)  0  is  interior  to  {y(  t^;  [u] ,  [v] )  :  [u]  eF([v])}  (10.6) 

(ii)  the  mapping  F(  [v] ) ->  (y(  t^;  [u],  [v] )  :  [u]  eF([v])} 

is  one-to-one  (10.7) 

( iii)  d([u],[v])<k  for  all  [u]eF([v])  (10.8) 

(iv)  oQu^tUg])  <  m|y(  tfe;  [u;L] ,  [v] )  -  y(tb;  [Ug],  [v])  | 

for  all  [u^]  and  [Ug]  €  F([v])  (10. 9) 


Let  F([v],Q)  be  the  subset  of  F([v])  defined  by 

F([v],Q)=  {[u]  :  [u] e  F( [v]),a( [u],[v])  <  Q)  (10.10) 

For  all  [u]  e  F([v])  such  that 

|y(tb;[u],[v])|  <  ^  (10.11) 

we  have,  by  applying  ( 10.9) > 

a((u],[v])<Q  (10-12) 


which  implies 
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[u]  c  F([v],Q) 


(10.13) 


We  may  then  write 

(i)  0  is  interior  to  (y( tb; [u], [v] )  :  [u]  e F( [v],Q)}  (10.14) 

(ii)  the  mapping  F( [v],Q)  -*Cy( t^; [u], [v] )  :  [u]  € F( [v],Q)J 
is  one-to-one 

(iii)  d([u],[v])<k  for  all  [u]eF([v],Q)  (10. 16) 

(iv)  ff([u1],[u2])  <  -y(tb;[u2],[v])| 

for  all  [u^  and  [Ug]  €  F(  [v],Q)  (10.17) 

From  Proposition  (6.13)  we  then  have 

F([v],Q)CF*([v],k)CF*  (10.18) 

i  .e 

y(tb;[u],[v])  exists  for  all  [u]cF([v],Q)  (10.19) 

We  also  know  that 

(i)  the  mapping  from  {y( t^; [u], [v] )  :  [u]  eF([v],Q)}  to 

F([v],Q)  is  continuous  (see  relation  (10.17))  (10.20) 

(ii)  the  mapping  from  F([v],Q)  to  { y( t^ ; [u],  [v] ) :  [u]  eF([v],Q)} 
is  continuous  (see  Proposition  (6.13))  (10.21) 

Hence  the  mapping  from  (y( t^j [u], [v] )  :  [u]  eF([v],Q)}  to 
(y(tb;[u],[v)) :  [u] c F( ( v] , Q) )  is  continuous.  (10.22) 
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Fran  Proposition  (7-13)  we  know  that  there  exists  a  function  g(r), 
defined,  continuous  and  nondecreasing  over  [0,Q]  such  that 

(i)  lim  =  0  (10.23) 

r->  0 

(ii)  |y(tb;[u),(v])  -  y(tbj[u],[v])|  <  g(o([u],[v])) 

for  all  [u]eF([v],Q)  (10.210 

We  define  the  function  G(r)  by  the  relation 

G(r)  =  g(mr)  (10.25) 

From  relations  (10.17),  (10.25)  and  (10.24),  we  know  that  there  exists 
a  function  G(r),  defined,  continuous,  nondecreasing  over  [0,  and 
such  that 

(i)  lim  =  0  (10.26) 

r-*  0  r 

(ii)  | y( t^j [u],[v])  ”  y( t^ j [u ] , [ v] ) |  <  0( I y( t^ j [u],[v]) | )  (10.27) 

We  apply  Proposition  (9*1)  to  relations  (10.14),  (10.22),  (10.26) 
and  (10.27)  and  we  obtain 

0  is  interior  to  {y(t^;[u], [v])  :  [u]  €  F( (v],Q)}  (10.28) 

and  a  fortiori 

0  is  interior  to  H([v])  (10.29) 

This  concludes  the  proof  of  Theorem  III. 

+ 

Proof  of  Theorem  IV.  By  Proposition  (8.11),  the  set  H([v])  is  convex, 

+ 

hence  if  y  ■  0  is  a  bcundary  point  of  H([v])  there  exists  a  hyperplane 
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(K([v])|y)  =  0 


(10.30) 


such  that 

(*([v))|y)<0  for  all  ycH([v])  (10.31) 

Let  us  assume  that  there  is  a  [u]  €  F  such  that 

(*(M)|G(t;[v])(f(x(t;[v]),u(t),t)  -  f(  x(  t ;  [  v] ),  v(  t),  t) )  >  £  >  0 

(10.32) 

for  t  e  E,  where  S  e  5  with  n(E)  >  0,  then  by  introducing  the 
vector-valued  function 

U*(t)  =  v(t)  +  X(E)(u(t)  -  v(t))  all  t  €  [ta,tb]  (10.33) 

we  obtain 

(*(  [v])|y(tb;[u#],[v])>  >  Ci»(E)  >0  (10. 3^) 

We  also  have 

y(tb;[u*Uv])  e  H([v])  (10-35) 

since  [u*]  c  F.  Relations  (10.3l)>  (10.34)  and  (10.35)  are  contra¬ 
dictory.  This  concludes  the  proof  of  Theorem  IV. 

Proof  of  Theorem  V.  Let  the  vector  p(t;[v])  be  defined  by 

relation  (4.2).  This  vector  p(t;[v])  is  non identically  zero  and 

continuous  over  tta,t^]  since  G(t;[v])  and  G  1(t;[v])  exist  and 

are  bounded  over  [t  ,t.  ].  Relation  (4.1)  may  be  written  under  the 
a  d 

fora: 
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<*([v])|G(tj[v])(f(x(tj[v]),u(t),t)-  f(x(tj[v]),v(t),t))>  <  0 


for  all  [u]  in  F  and  a.e.  t  € 

or,  from  the  definition  of  a  transposed  matrix 

(GT(t;[v])*(  [v]) |f(x(tj[v]),u(t),t)  -  f(x(t;[v]),v(t),t)>  <  0 

for  all  [u]  in  F  and  a.e.  t  e  [t  ,t,  ] 

a  D 

i.e.,  from  relation  (4.2), 

(p(t;[v])|f(x(t;[v]),u(t),t)  -  f(x(t;[v]),v(t),t))  <  0 

for  all  [u]  in  F  and  a.e.  t  e  [t  ,t,  ] 

a  d 

This  proves  relation  (4.3)*  By  differentiation  withrespect  to 
relation  (4.2),  we  obtain 

p(t; [v])  =  (GT(  t; [v] ) ) "x( [v] ) 

=  (G(t;[v]))Tn([v])  a.e.  t  e  [t^t^ 

but  by  definition  (see  relation  (2.3))>  we  have 

G(t;[v])  *-G(t;[v])D(tj[v])  a.e.  t  e 

hence 

P(t;[v])  =  (  -G(t;[v])D{t;[v]))Tn([v]) 

“  -  DT(tj[v])GT(t;[v])n( [v]) 

*  '  DT(tj[v])p(t;[v]) 

This  proves  relation  (4.4)  and  concludes  the  proof  of  Theorem  V 


(10.J6) 

(10.57) 

(10.58) 
t  of 

(10.59) 

(10.40) 


(10.41) 


107 


Proof  of  Theorem  VI .  This  theorem  is  Just  a  logical  conclusion  of 


Theorems  II,  III,  IV  and  V:  if  the  point  x(t^;[v])  is  a  boundary 

point  of  the  set  H,  then  the  point  y  *  0  is  a  boundary  point  of  the 

set  H([v])  (see  Theorem  II),  then  the  point  y  =  0  is  a  boundary 
+ 

point  of  the  set  H([v])  (see  Theorem  III),  then  there  exists  a  non¬ 
zero  constant  vector  n([v])  such  that  condition  (4.1)  is  satisfied 
for  all  [u]  in  F  (see  Theorem  IV),  then  there  exists  a  vector 
p(t;[v])  continuous,  nonidentically  zero  on  [t^t^]  and  satisfying 
conditions  (4.3)  and  (4.4)  (see  Theorem  V).  This  concludes  the  proof 
of  Theorem  VI. 

Proof  of  Theorem  VII.  This  theorem  is  just  a  logical  conclusion  of 
Theorems  I  and  VI:  if  an  element  [v]  of  F*  is  optimal,  then  the 
point  x(t^;[v])  Is  a  boundary  point  of  the  set  H  (see  Theorem  I), 
then  there  exists  a  vector  p(t;[v])  continuous,  nonidentically  zero 
on  and  satisfying  conditions  (4.3)  and  (4.4)  (see  Theorem  VI). 

This  concludes  the  proof  of  Theorem  VII. 
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